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ABSTRACT 

We present a comprehensive analysis of strong-lensing, weak-lensing shear and magnification data for a sam¬ 
ple of 16 X-ray-regular and 4 high-magnification galaxy clusters at 0.19 <, z ^ 0.69 selected from the Cluster 
Lensing And Supernova survey with Hubble (CLASH). Our analysis combines constraints from 16-band Hub¬ 
ble Space Telescope observations and wide-field multi-color imaging taken primarily with Suprime-Cam on 
the Subaru Telescope, spanning a wide range of cluster radii (10"-16')- We reconstruct surface mass density 
profiles of individual clusters from a joint analysis of the full lensing constraints, and determine masses and 
concentrations for all clusters. We find internal consistency of the ensemble mass calibration to be < 5% ± 6% 
in the one-halo regime (200-2000 kpc h~ l ) by comparison with the CLASH weak-lensing-only measurements 
of Umetsu et al. For the X-ray-selected subsample of 16 clusters, we examine the concentration-mass (c-M) 
relation and its intrinsic scatter using a Bayesian regression approach. Our model yields a mean concentration 
of c|2—0.34 = 3.95 ± 0.35 at M200C — 14 x 10 14 Mq and an intrinsic scatter of er(lnc2ooc) = 0.13 ± 0.06, 
in excellent agreement with A cold dark matter predictions when the CLASH selection function based on 
X-ray morphological regularity and the projection effects are taken into account. We also derive an ensemble- 
averaged surface mass density profile for the X-ray-selected subsample by stacking their individual profiles. 

The stacked lensing signal is detected at 33er significance over the entire radial range < 4000 kpc h~ l , ac¬ 
counting for the effects of intrinsic profile variations and uncorrelated large-scale structure along the line of 
sight. The stacked mass profile is well described by a family of density profiles predicted for cuspy dark- 
matter-dominated halos in gravitational equilibrium, namely, the Navarro-Frenk-White (NFW), Einasto, and 
DARKexp models, whereas the single power-law, cored isothermal and Burkert density profiles are disfavored 
by the data. We show that cuspy halo models that include the large-scale two-halo term provide improved 
agreement with the data. For the NFW halo model, we measure a mean concentration of C200C = 3.79^Q2g at 
M 200 C = 14.lii Q x 1O 14 M0, demonstrating consistency between complementary analysis methods. 

Keywords: cosmology: observations — dark matter — galaxies: clusters: general — gravitational lensing: 
strong — gravitational lensing: weak 


1. INTRODUCTION 

Clusters of galaxies represent the largest and rarest class of 
self-gravitating systems formed in the universe. In the con¬ 
text of hierarchical models of structure formation, the evo¬ 
lution of the cluster abundance with cosmic time is a sensi¬ 
tive probe of the amplitude and growth rate of the primor¬ 
dial fluctuation spectrum because cluster-sized systems popu- 
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late the exponential tail of the cosmic mass function of dark- 
matter (DM, hereafter) halos (Haiman et al. 2001). There¬ 
fore, large cluster samples with well-defined selection func¬ 
tions can, in principle, provide an independent test of any vi¬ 
able cosmological model, including the current concordance 
A cold dark matter (ACDM) model defined in the framework 
of general relativity, complementing cosmic microwave back¬ 
ground (CMB) anisotropy, large-scale galaxy clustering, su¬ 
pernova, CMB lensing and cosmic shear experiments. Cur¬ 
rently, cluster samples are often defined by optical. X-ray, or 
Sunyaev-Zel’dovich effect (SZE) observables (e.g., Vikhlinin 
et al. 2009; Planck Collaboration et al. 2014), so that the 
masses are indirectly inferred from scaling relations. In the 
last few years, a systematic effort has been conducted to en¬ 
able a self-consistent calibration of mass-observable relations 
using robust cluster lensing measurements (von der Linden 
et al. 2014a; Gruen et al. 2014; Umetsu et al. 2014; Ford et al. 
2014; Melchior et al. 2015; Jimeno et al. 2015; Hoekstra et al. 
2015; Merten et al. 2015) and well-defined selection functions 
(Benitez et al. 2014; Miyazaki et al. 2015). 

Considerable progress has been made in understanding 
the formation and evolution of DM halos in an expand¬ 
ing universe, governed by nonlinear gravitational growth of 
cosmic density perturbations. iY-body simulations of col¬ 
lisionless CDM established a nearly self-similar form for 
the spherically-averaged density profile Ph(r) of DM halos 
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(Navarro et al. 1997, hereafter Navarro-Frenk-White, NFW) 
over a wide range of halo masses and radii, with some in¬ 
trinsic variance associated with the mass accretion histories 
of individual halos (Jing & Suto 2000; Tasitsiomi et al. 2004; 
Graham et al. 2006; Navarro et al. 2010; Ludlow et al. 2013; 
Diemer & Kravtsov 2014; Adhikari et al. 2014). The degree 
of mass concentration, C 200 C = T2 ooc/t-2> 12 is predicted to 
correlate with halo mass because the scale radius r _2 stays 
approximately constant when the halo leaves the fast accre¬ 
tion regime, whereas r 2 ooc does still grow, thus increasing 
concentration. Cluster-sized halos are thus predicted to be 
less concentrated than less massive halos and to have typical 
concentrations of c 2 oo c = 3-4 (Bhattacharya et al. 2013; Dut¬ 
ton & Maccio 2014; Meneghetti et al. 2014). 

Galaxy clusters act as powerful gravitational lenses (e.g., 
Kneib & Natarajan 2011), offering a direct probe for test¬ 
ing these well defined predictions of DM halo structure. The 
critical advantage of cluster gravitational lensing is its ability 
to map the mass distribution of individual systems indepen¬ 
dent of assumptions about their physical and dynamical state. 
Clusters produce a variety of detectable lensing effects, in¬ 
cluding deflection, shearing, and magnifying of the images of 
background sources (Bartelmann & Schneider 2001). In the 
weak regime where the lensing signal is approximately lin¬ 
early related to the potential, lensing can be used to probe 
the mass distribution of clusters in a model-independent man¬ 
ner (e.g., Kaiser & Squires 1993; Fahlman et al. 1994; Kaiser 
1995; Umetsu et al. 1999; Clowe et al. 2000). In the strong 
regime, several sets of multiple images with known redshifts 
allow us to tightly constrain the central mass distribution (e.g., 
Zitrin et al. 2012a; Jauzac et al. 2014). For a massive cluster, 
the two complementary regimes contribute similar logarith¬ 
mic radial coverage (Umetsu et al. 2011b, see their Figure 
6 ). Hence, combining strong and weak lensing can signifi¬ 
cantly improve constraints on the cluster mass distribution for 
full radial coverage (Bradac et al. 2006; Limousin et al. 2007; 
Merten et al. 2009; Diego et al. 2015). 

The Cluster Lensing And Supernova survey with Hubble 
(CLASH, Postman et al. 2012) 13 is a 524-orbit Multi-Cycle 
Treasury program that has been designed to probe the mass 
distribution of 25 galaxy clusters using their gravitational 
lensing properties, providing a sizable sample of mass cal¬ 
ibrators for accurate cluster cosmology. All CLASH clus¬ 
ters were observed in 16 filters with the Hubble Space Tele¬ 
scope (HST ). This 16-band HST photometry has enabled us 
to uncover many sets of multiple images (Zitrin et al. 2012a, 
2015), whose spectroscopic redshifts have been determined 
from a dedicated spectroscopic survey conducted with the VI- 
MOS spectrograph on the Very Large Telescope (VLT; Bi- 
viano et al. 2013; Balestra et al. 2013; Rosati et al. 2014; Gi- 
rardi et al. 2015). CLASH has produced combined strong- 
and weak-lensing analyses of HST and Subaru Telescope ob¬ 
servations, allowing for detailed mapping of the cluster mass 
distributions (Umetsu et al. 2012; Coe et al. 2012; Medezinski 
et al. 2013; Zitrin et al. 2015; Merten et al. 2015). Donahue 
et al. (2014) derived radial profiles of temperature, gas mass, 
and hydrostatic mass for the full CLASH sample using XMM- 
Newton and Chandra observations. 

A major goal of the CLASH survey is to test models of 

12 The quantity r 200 c is defined as the radius within which the mean in¬ 
terior density is 200 times the critical density p c (z) of the universe at the 
cluster redshift z, and r _2 is a scale radius at which d In ()\Jdlnr = —2. 

13 http://www.stsci.edu/ postman/CLASH/ 


structure formation by using the halo concentration-mass (c- 
M) relation determined from cluster gravitational lensing. 
For this aim, twenty CLASH clusters were selected to have 
X-ray temperatures greater than 5 keV and to show a smooth 
X-ray morphology, with no lensing information used a priori 
(Postman et al. 2012). A further sample of five clusters were 
selected by their high lens magnification properties to study 
magnified high-redshift galaxies behind the clusters (Zheng 
et al. 2012; Coe et al. 2013; Monna et al. 2014). 

Recently, we have carried out a systematic study of the 
CLASH sample to obtain measurements of mass and con¬ 
centration from cluster lensing (Meneghetti et al. 2014; 
Umetsu et al. 2014; Merten et al. 2015). Meneghetti et al. 
(2014) presented a detailed characterization of the CLASH 
X-ray-selected clusters with numerical simulations to make 
predictions about their intrinsic and observational proper¬ 
ties. Umetsu et al. (2014) conducted a joint shear-and- 
magnification weak-lensing analysis of a subsample of the 
CLASH clusters, using wide-field multi-color imaging taken 
primarily with Subaru/Suprime-Cam. Merten et al. (2015) 
presented a two-dimensional strong- and weak-lensing (here¬ 
after SaWLens) analysis of 19 CLASH X-ray-selected clus¬ 
ters, by combining strong and weak-shear lensing data from 
16-band HST imaging with wide-field weak-shear data ana¬ 
lyzed by Umetsu et al. (2014). In both analyses, we find ex¬ 
cellent agreement between the data and ACDM predictions 
when the projection effects and the selection function based 
on X-ray morphology are taken into account. More recently, 
Xu et al. (2016) carried out an observational and theoretical 
study of the abundance of gravitationally lensed arcs in the 
CLASH survey, finding full agreement between the observa¬ 
tions and simulations using an automated, objective arcfind¬ 
ing algorithm. 

In this paper we present a comprehensive joint analysis of 
strong-lensing, weak-lensing shear and magnification data for 
a sample of 16 X-ray-regular and 4 high-magnification clus¬ 
ters selected from the CLASH survey. Our extended anal¬ 
ysis combines the constraints from the CLASH Subam and 
HST lensing data sets of Umetsu et al. (2014) and Zitrin et al. 
(2015). We aim at combining these complementary lens¬ 
ing constraints to construct individual cluster surface mass 
density profiles, from which to determine the c-M relation. 
This improved mass-profile data set also allows us to obtain 
an ensemble calibration of cluster masses and to probe the 
ensemble-averaged cluster mass distribution. 

The paper is organized as follows. In Section 2 we briefly 
describe the basic theory of cluster gravitational lensing. Af¬ 
ter summarizing the properties of the CLASH sample, we out¬ 
line in Section 3 the formalism and procedure for constructing 
surface mass density profiles from a joint analysis of strong- 
lensing, weak-lensing shear and magnification constraints. In 
Section 4 we revisit the mass profile analysis of individual 
CLASH clusters by combining HST and ground-based lens¬ 
ing measurements. In Section 5, we conduct stacked lens¬ 
ing analysis of the X-ray-selected subsample to study their 
ensemble-averaged mass distribution. In Section 6 we ex¬ 
amine the concentration-mass relation for the X-ray-selected 
subsample using Bayesian regression methods. Section 7 is 
devoted to the discussion of the results. Finally, a summary is 
given in Section 8. 

Throughout this paper, we adopt a concordance ACDM 
cosmology with = 0.27, = 0.73, and h = 

O. 7/170 = 0.7 (Komatsu et al. 2011), where Hq = h x 
lOOkms -1 Mpc -1 = /170 x 70kms _1 Mpc -1 . We use the 


Umetsu et al. 
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standard notation Ma c (A/a,,, ) to denote the mass enclosed 
within a sphere of radius ta c (fA m X within which the mean 
overdensity is A c (A m ) times the critical density p c {z) (mean 
background density p m (z)) at the cluster redshift z. All 
quoted errors are 68.3% (lcr) confidence limits (CL) unless 
otherwise stated. 

2. CLUSTER LENSING BASICS 

In the cluster lensing regime (Umetsu 2010), the conver¬ 
gence k = E/E c is the projected surface mass density E(0) 
in units of the critical surface mass density for lensing, E c = 
{c 2 D s )/{A'kGD\D\ s ) = c 2 /( 47 tGDi/ 3 ), where D\, D s , and 
D\ s are the lens, source, and lens-source proper angular di¬ 
ameter distances, respectively, and j3(z, Z\) = D\ s /D s is the 
geometric lensing strength as a function of source redshift z 
and lens redshift z\. 

The gravitational shear 7 can be directly observed from el- 
lipticities of background galaxies in the regime where n <C 1 . 
The tangential shear component 7 + averaged around a circle 
of radius 9 satisfies the following identity (Kaiser 1995): 


(Bartelmann & Schneider 2001). The associated critical sur¬ 
face mass density is E c oo = c 2 /(47tG.Di/3 00 ). The source- 
averaged convergence and shear fields are then expressed as 
(k)x = {W) X K 00 and ( 7 )* = {W) x Too, using those in the 
far-background limit. Hereafter, we use the far-background 
lensing fields, K^iO) and 7 ^ (6), to describe the projected 
mass distribution of clusters. 

3. CLUSTER SAMPLE, DATA, AND METHODOLOGY 

In this section, we outline the analysis procedure used to 
combine strong-lensing, weak-lensing shear and magnifica¬ 
tion data for direct reconstruction of cluster surface mass den¬ 
sity profiles. In Section 3.1, we present a summary of the 
properties of our cluster sample. In Section 3.2, we describe 
the background galaxy selection for the weak-lensing shear 
and magnification analysis. In Sections 3.3, 3.4, and 3.5, we 
describe our methods for measuring cluster lensing profiles as 
a function of clustercentric radius. In Section 3.6 we outline 
the joint likelihood approach of Umetsu (2013) to perform a 
mass profile reconstruction from multi-probe lensing data. 


7 + (0) = k(<0)-k( 0) = AE(0)/E c , (1) 

with k{9) = E(0)/E c the azimuthally averaged convergence 
at radius 9, n(< 9) = E(< 9)/ E c the average convergence 
interior to 9, and AE(0) = E(< 9) — E(0) the differential 
surface mass density. 

The observable quantity for quadrupole weak lensing in 
general is not the shear but the reduced gravitational shear. 


g{0) 


7(3) 

1 — k(G) ’ 


( 2 ) 


which is invariant under k(0) —> A k(8) + 1 — A and 7 ( 0 ) —► 
Xy(0) with an arbitrary constant A / 0, known as the mass- 
sheet degeneracy (Falco et al. 1985; Gorenstein et al. 1988; 
Schneider & Seitz 1995). This degeneracy can be broken, for 
example , 14 by measuring the magnification factor, 

M6) = [1 — «( 0)] 2 — | 7 ( 0 ) l 2 ’ P) 


which transforms as p(0) —> A 2 p{9). 

We consider a population of source galaxies described by 
their redshift distribution function, N(z ), for statistical weak- 
lensing measurements. The mean lensing depth for a given 
population ( X = g , p) is given by 


(P)x = 


dz N x (z)/3(z ) 


dz N x {z) 


1 Go 


(4) 


In general, we apply different size, magnitude, and color cuts 
in source selection for measuring the shear and magnification 
effects, leading to different N x (z). In contrast to the former 
effect, the latter does not require source galaxies to be spa¬ 
tially resolved, but it does require a stringent flux limit against 
incompleteness effects. 

We introduce the relative lensing depth of a source pop¬ 
ulation with respect to a fiducial source in the far back¬ 
ground as (W) x = {/3} x //3 oo with ^ = /3(z -» 00 , 21 ) 


14 Alternatively, one may constrain the constant A such that the enclosed 
mass within a certain aperture is consistent with mass estimates from inde¬ 
pendent observations (e.g., Umetsu & Futamase 2000). See also Section 4. 1. 


3.1. Cluster Sample 

Our cluster sample stems from the CLASH shear-and- 
magnification weak-lensing analysis of Umetsu et al. (2014) 
based primarily on Subaru multi-color imaging. This sample 
comprises two subsamples, one with 16 X-ray regular clusters 
and another with four high-magnification clusters, both taken 
from the CLASH sample of Postman et al. (2012). 

Table 1 gives a summary of the properties of 20 clusters in 
our sample. Following Umetsu et al. (2014), we adopt the 
brightest cluster galaxy (BCG) position as the cluster center 
for our mass profile analysis. As discussed by Umetsu et al. 
(2014), our sample exhibits, on average, a small positional 
offset between the BCG and X-ray peak, characterized by an 
rms offset of cr 0 gf ~ 30kpc /i _1 . For the X-ray-selected sub¬ 
sample, cr 0 ff ~ llkpc/r -1 . This level of offset is negligi¬ 
ble compared to the range of overdensity radii of interest for 
mass measurements (A c /£ 2500). Hence, smoothing from 
the miscentering effects will not significantly affect our clus¬ 
ter mass profile measurements (Johnston et al. 2007; Umetsu 
et al. 2011 a). 

3.2. Background Galaxy Selection for Weak Lensing 

A careful selection of background galaxies is critical for a 
cluster weak-lensing analysis, so that unlensed cluster mem¬ 
bers and foreground galaxies do not dilute the background 
lensing signal. Umetsu et al. (2014) used the color-color (CC) 
selection method of Medezinski et al. (2010), typically using 
the Subaru/Suprim-Cam BjRqz' photometry where available 
(Umetsu et al. 2014, Tables 1 and 2), which spans the full op¬ 
tical wavelength range. The photometric zero points were pre¬ 
cisely calibrated to an accuracy of ~ 0.01 mag, using the HST 
photometry of cluster elliptical galaxies and with the help of 
galaxies with measured spectroscopic redshifts. 

For shape measurements, Umetsu et al. (2014) combined 
two distinct populations that encompass the red and blue 
branches of background galaxies in CC-magnitude space, 
having typical redshift distributions peaked around z ~ 1 
and ~ 2, respectively (see Figures 1, 5, and 6 of Medezin¬ 
ski et al. 2011). For validation purposes, we have compared 
our blue+red background samples with spectroscopic samples 
obtained from the CLASH-VLT large spectroscopic program 
with VIMOS (Rosati et al. 2014) providing thousands of spec- 
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Table 1 

Properties of the cluster sample 


Cluster 

21 

R.A. a 
(12000.0) 

decl. a 

(J2000.0) 

k B T x b 

(keV) 

0Ein C 

(") 

6 = 10" 

M 2D (10 13 

e = 20 " 

Mq h 7 g ) d 

e = 30 " 

II 

4^. 

O 

X-ray Selected: 

Abell 383 

0.187 

02:48:03.40 

-03:31:44.9 

6.5 ±0.24 

15.1 

1.15 ± 0.17 

3.04 ±0.51 

4.98 ±0.96 

6.77 ± 1.35 

Abell 209 

0.206 

01:31:52.54 

-13:36:40.4 

7.3 ± 0.54 

8.9 

0.93 ±0.14 

2.36 ±0.45 

3.96 ±0.89 

5.60 ± 1.40 

Abell 2261 

0.224 

17:22:27.18 

+32:07:57.3 

7.6 ±0.30 

23.1 

1.91 ± 0.31 

4.79 ± 0.69 

7.67 ± 1.26 

10.42 ± 1.84 

RXI2129.7+0005 

0.234 

21:29:39.96 

+00:05:21.2 

5.8 ±0.40 

12.9 

1.13 ±0.18 

3.36 ± 0.44 

5.95 ±0.74 

8.67 ± 1.10 

Abell 611 

0.288 

08:00:56.82 

+36:03:23.6 

7.9 ±0.35 

18.1 

1.84 ±0.25 

5.25 ± 0.83 

9.37 ± 1.82 

14.26 ±3.06 

MS2137-2353 

0.313 

21:40:15.17 

-23:39:40.2 

5.9 ±0.30 

17.1 

2.25 ± 0.33 

5.23 ± 0.81 

7.99 ± 1.38 

10.76 ±2.00 

RXI2248.7-4431 

0.348 

22:48:43.96 

-44:31:51.3 

12.4 ± 0.60 

31.1 

2.47 ± 0.45 

7.36 ± 1.02 

13.14 ± 1.96 

19.41 ± 3.07 

MACSI1115.9+0129 

0.352 

11:15:51.90 

+01:29:55.1 

8.0 ±0.40 

18.1 

1.85 ± 0.37 

5.79 ± 0.84 

11.09 ± 1.50 

16.98 ±2.37 

MACSI1931.8-2635 

0.352 

19:31:49.62 

-26:34:32.9 

6.7 ± 0.40 

22.2 

2.91 ± 0.60 

7.37 ± 1.12 

12.15 ± 1.86 

17.21 ± 2.90 

RXI1532.9+3021 

0.363 

15:32:53.78 

+30:20:59.4 

5.5 ±0.40 

— 

— 

— 

— 

— 

MACSI1720.3+3536 

0.391 

17:20:16.78 

+35:36:26.5 

6.6 ±0.40 

20.1 

2.65 ± 0.35 

7.20 ± 1.08 

12.39 ± 2.17 

17.97 ± 3.36 

MACSI0429.6-0253 

0.399 

04:29:36.05 

-02:53:06.1 

6.0 ±0.44 

15.7 

2.22 ± 0.39 

6.80 ± 0.96 

12.55 ± 1.86 

18.87 ± 3.02 

MACSI1206.2-0847 

0.440 

12:06:12.15 

-08:48:03.4 

10.8 ±0.60 

26.8 

3.37 ± 0.50 

9.51 ± 1.39 

16.37 ± 2.50 

23.18 ±3.88 

MACSI0329.7-0211 

0.450 

03:29:41.56 

-02:11:46.1 

8.0 ±0.50 

24.1 

3.40 ± 0.62 

8.66 ± 1.26 

14.36 ± 2.16 

21.12 ±3.17 

RXI1347.5-1145 

0.451 

13:47:31.05 

-11:45:12.6 

15.5 ±0.60 

33.0 

3.56 ±0.76 

11.55 ±2.14 

20.40 ±3.18 

29.25 ±4.12 

MACSI0744.9+3927 

0.686 

07:44:52.82 

+39:27:26.9 

8.9 ±0.80 

24.3 

4.70 ± 0.92 

13.55 ± 2.13 

23.64 ± 3.35 

34.79 ±4.57 

High Magnification: 

MAC SJ0416.1-2403 

0.396 

04:16:08.38 

-24:04:20.8 

7.5 ±0.80 

25.9 

1.33 ±0.18 

5.35 ± 0.79 

11.32 ± 1.68 

17.43 ±2.51 

MACSJ1149.5+2223 

0.544 

11:49:35.69 

+22:23:54.6 

8.7 ± 0.90 

20.4 

2.88 ±0.51 

9.29 ± 1.39 

17.80 ±2.87 

28.09 ± 5.07 

MACSJ0717.5+3745 

0.548 

07:17:32.63 

+37:44:59.7 

12.5 ±0.70 

55.0 

2.01 ± 0.19 

8.04 ± 0.74 

18.79 ± 2.01 

35.80 ±5.83 

MACS J0647.7+7015 

0.584 

06:47:50.27 

+70:14:55.0 

13.3 ± 1.80 

26.4 

3.62 ± 0.77 

11.75 ± 1.84 

21.59 ±3.32 

31.95 ± 5.45 


a The cluster center is taken to be the location of the BCG when a single dominant central galaxy is found. Otherwise, in the case of MACSJ0717.5+3745 and 
MACSJ0416.1—2403, it is defined as the center of the brightest red-sequence-selected cluster galaxies (Umetsu et al. 2014). 
k X-ray temperature from Postman et al. (2012). 

c Effective Einstein radius for a fiducial source at z s — 2 determined from the HST strong and weak-shear lensing analysis by Zitrin et al. (2015). The reported values are the 
average of two different models (where available) of Zitrin et al. (2015). The typical model uncertainty in #Ein is 10%. 

d Lensing estimates of the projected cluster mass M 21 } (< 9) enclosed within a cylinder of radius 9. The data here are constructed for each cluster by combining two different 
lens models (where available) of Zitrin et al. (2015). For details, see Section 3.5. 


troscopic redshifts for cluster members and intervening galax¬ 
ies along the line of sight, including lensed background galax¬ 
ies (e.g., Biviano et al. 2013; Balestra et al. 2013). Combining 
CLASH-VLT spectroscopic redshifts with the Subaru multi¬ 
band photometry available for 10 southern CLASH clusters, 
we find a mean contamination fraction of (2.4 ± 0.7)% in our 
blue+red background regions in CC-magnitude space, where 
the error accounts for Poisson statistics. 

Our magnification-bias measurements are based on flux- 
limited samples of red background galaxies (Rc z' % 0.5, 
Medezinski et al. 2011). Apparent faint magnitude cuts 
(wiim) were applied for each cluster in the reddest CC- 
selection band (typically the Subaru z' band) to avoid incom¬ 
pleteness near the detection limit. The threshold mi; m was 
chosen at the magnitude where the source number counts turn 
over, and it corresponds on average to the 6 <r limiting mag¬ 
nitude within 2" diameter aperture. Our CC selection does 
not cause incompleteness at the faint end in the bluer filters 
(for a general discussion, see Hildebrandt 2016) because we 
have deeper photometry in the bluer bands (this is by design, 
so as to detect the red galaxies; see Broadhurst 1995) and our 
“CC-red” galaxies are relatively blue in Bj — Rq (Figure 1 of 
Medezinski et al. 2011). 

The mean depths (/?) and (/3 2 ) of the background sam¬ 
ples were estimated using photometric redshifts of individ¬ 
ual galaxies determined with the BPZ code (Benitez 2000; 
Benitez et al. 2004) from our point-spread-function (PSF) cor¬ 
rected multi-band photometry (typically with 5 Subaru filters; 
Table 1 of Umetsu et al. 2014). An excellent statistical agree¬ 
ment was found between the depth estimates (/ 3 ) from our 
BPZ measurements in the cluster fields and those from the 
COSMOS photometric-redshift catalog (Ilbert et al. 2009), 
with a median relative offset of 0.27% and an rms field-to- 


field scatter of 5.0% (Umetsu et al. 2014). 

3.3. Reduced Tangential Shear 

We use the azimuthally averaged radial profile of the re¬ 
duced tangential shear g + = 7 + /(l — n) as the primary 
constraint from wide-field weak-lensing observations. We 
adopt the following approximation for the nonlinear correc¬ 
tions to the source-averaged reduced tangential shear ( g + ) = 

[J^°dz N g (z)g + (z)\ [J^°dz iV 9 (,z)] 1 (Seitz & Schneider 
1997): 

, \ _ (W)g [«oo(< 0) - «oo(fl)] = ( 7 +) 

[9+> ~ 1 - Koo (0)(W2) g /(W) g 1 - f w , g {n) 5 1; 

where {W) g is the relative lensing strength (Section 2) aver¬ 
aged over the population N g (z) of source galaxies, fw.g — 
(PU 2 ) S /(PU) 2 is a dimensionless quantity of the order unity, 

(k) = ( W) g K 00 , and ( 7 +) = (FF) g 7 ±,oo- 

In the present study, we use the weak lensing shear data ob¬ 
tained by Umetsu et al. (2014). The shear analysis pipeline 
of Umetsu et al. (2014) was implemented based on the pro¬ 
cedures described in Umetsu et al. (2010) and on verifica¬ 
tion tests with mock ground-based observations (Massey et al. 
2007; Oguri et al. 2012). The key feature of the shear calibra¬ 
tion method of Umetsu et al. (2010) is that we use galaxies 
detected with very high significance, v > 30, to model the 
PSF isotropic correction as function of object size and mag¬ 
nitude. Here v is the peak significance given by the IMCAT 
peak-finding algorithm hfindpeaks. Recently, a very similar 
procedure was used by the Local Cluster Substructure Survey 
(LoCuSS) collaboration in their Subaru weak-lensing study 
of 50 clusters (Okabe & Smith 2015). Another important 
feature is that we select isolated galaxies for the shape mea¬ 
surement to minimize the impact of crowding and blending 
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(Umetsu et al. 2014). To do this, we first identify objects 
having any detectable neighbor within 3 r g , with r g the Gaus¬ 
sian scale length given by hfindpeaks. All such close pairs 
of objects are rejected. After this close-pair rejection, objects 
with low detection significance v < 10 are excluded from 
our analysis. All galaxies with usable shape measurements 
are then matched with sources in our CC-magnitude-selected 
background galaxy samples (Section 3.2), ensuring that each 
galaxy is detected in both the reddest CC-selection band and 
the shape-measurement band. 

Using simulated Subaru/Suprime-Cam images (Massey 
et al. 2007; Oguri et al. 2012), Umetsu et al. (2010) find 
that the lensing signal can be recovered with \m\ ~ 0.05 
of the multiplicative shear calibration bias and c ~ 10 -3 of 
the residual shear offset (as defined by Heymans et al. 2006; 
Massey et al. 2007). Accordingly, Umetsu et al. (2014) in¬ 
cluded for each galaxy a shear calibration factor of 1/(1 + to) 
(g 9 / 0.95) to account for residual calibration. As noted 
by Umetsu et al. (2012), the degree of multiplicative bias to 
depends modestly on the seeing conditions and the PSF qual¬ 
ity (Oguri et al. 2012, see Section 3.2 for their image sim¬ 
ulations using Gaussian and Moffat PSF profiles with 0.5"- 
1.1"FWHM). This variation with the PSF properties limits 
the shear calibration accuracy to 5m ~ 0.05 (Umetsu et al. 
2012, Section 3.3). 

From shape measurements of background galaxies, the av¬ 
eraged reduced tangential shear was measured in a set of con¬ 
centric annuli (i = 1, 2,..., A\yi.) centered on each cluster as 


(s-m) 


9+(k) 

.kei 


.k£i 


-1 


( 6 ) 


where the index k runs over all objects located within the ith 
annulus, g+(k) is an estimate of g + for the fcth object, and w^) 
is its statistical weight given by = 1/(cr^ fc ) + a: 2 ), with 
<J g (k) the uncertainty in the estimate of reduced shear gi j.) and 
a g the softening constant taken to be a g = 0.4 (Umetsu et al. 
2014), a typical value of the mean dispersion (cr 2 ) 1 / 2 in Sub¬ 
aru observations (e.g., Umetsu et al. 2009; Oguri et al. 2009; 
Okabe et al. 2010). Here a includes both intrinsic shape and 
measurement noise contributions. The statistical uncertainty 
a in (g+'i) was estimated from bootstrap resampling of the 
background source catalog for each cluster. 

The reduced tangential shear profiles analyzed in this study 
are presented in Figure 2 of Umetsu et al. (2014). For all clus¬ 
ters in our sample, the estimated values for (/3) g , and fw,g are 
summarized in Table 3 of Umetsu et al. (2014). We marginal¬ 
ize over the calibration uncertainty of ( f3) g in our joint likeli¬ 
hood analysis of multiple lensing probes (Section 3.6.2). 


3.4. Magnification Bias 

A fundamental limitation of measuring shear only is the 
mass-sheet degeneracy (Section 2; see also Section 4.2). We 
can break this degeneracy by using the complementary com¬ 
bination of shear and magnification (Schneider et al. 2000; 
Umetsu & Broadhurst 2008; Rozo & Schmidt 2010; Umetsu 
2013). 

Deep multi-color photometry enables us to explore the faint 
end of the luminosity function of red quiescent galaxies at 
z ~ 1 (Ilbert et al. 2010). For such a population, the effect of 
magnification bias is dominated by the geometric area distor¬ 
tion because few fainter galaxies can be magnified up into the 


flux-limited sample; this results in a net depletion of source 
counts (Taylor et al. 1998; Broadhurst et al. 2005; Umetsu 
& Broadhurst 2008; Umetsu et al. 2010, 2011b, 2012, 2014, 
2015; Ford et al. 2012; Coe et al. 2012; Medezinski et al. 
2013; Radovich et al. 2015). In the regime of negative mag¬ 
nification bias, a practical advantage is that the effect is not 
sensitive to the exact form of the source luminosity function 
(Umetsu et al. 2014). 

If the magnitude shift 5m = 2.51og 10 /j, of an object due 
to magnification is small compared to that on which the log¬ 
arithmic slope of the luminosity function varies, the number 
counts can be locally approximated by a power law at the lim¬ 
iting magnitude mii m . The magnification bias at redshift z is 
then given by (Broadhurst et al. 1995) 

Np(0, z-, < m lim ) = N„(z) p 2 5s ~ 1 (0, z) = Ng(z)bg(G, z), 

_ _ (?) 
where Ng(z) = Ng(z;< mn m ) is the unlensed mean 
source counts and s is the logarithmic count slope eval¬ 
uated at TOlim, s = [dlog 10 Ng(z; < rn)/dm] m=mi>lD . 
In the regime where 2.5s <C 1, a net count depletion 
results. Accounting for the spread of Ng(z), we ex¬ 
press the population-averaged magnification bias as (6 ; ,) = 

[f™dzNg(z)bg(z)\ [J^° dz N g(z)] \ Following Umetsu 
et al. (2014), we interpret the observed number counts on a 
grid of equal-area cells ( n = 1, 2,...) as (see Appendix A.2 of 
Umetsu (2013)) 

(bg,{G n )} = Ng(G n -< m\\ m )/N^(< TOi im ) « (/x _1 (6» n )) 1-2 
_ ( 8 ) 
with s e ff = [dlog 10 Ng(< rn)/drn] m=miim the effective 
count slope defined in analogy to Equation (7). Equation (8) 
is exact for s e ff = 0 and gives a good approximation for de¬ 
pleted populations with s e g <C 0.4. 

The covariance matrix Cov[N(9 m ), N(G n )] = ( C]si)mn of 
the source counts includes the clustering and Poisson contri¬ 
butions (Hu & Kravtsov 2003) as (C^mn = ((V M ) 2 w mn + 
bmuNg^Gm)' with oj mn the cell-averaged angular correlation 
function of source galaxies (see Equation (14) of Umetsu et al. 
2015). The angular correlation length of background galaxies 
can be small (Connolly et al. 1998; van Waerbeke 2000) com¬ 
pared to the typical resolution ~ V of cluster weak lensing, 
so that the correlation between different cells can be generally 
ignored, whereas the unresolved and nonvanishing correlation 
on small angular scales accounts for increase of the variance 
of counts. We thus approximate C\; by (Umetsu et al. 2015) 

~ + A^(0 m )] 5 mn , (9) 

with ( 5N 2 (G m )) the total variance of the mth counts. To en¬ 
hance the signal-to-noise ratio (S/N), we calculate the surface 
number density n.g{6) = dN^(d)/dfl of source galaxies as 
a function of clustercentric radius, by averaging the counts 
in concentric annuli centered on the cluster. The source- 
averaged magnification bias is then expressed as ( n^O)) = 
n At (/Lt _1 (0)) 1_2 - 5s ' ,ft with rig the unlensed mean surface num¬ 
ber density. 

We measure the magnification bias signal in each annulus 
(i = 1, 2,..., IVwl) as (Umetsu et al. 2015) 

(tv,*) = __ f 'PimNg (G m ) (10) 

/mask,v^cell m 

with fl ce ii the solid angle of the cell and V r ,n = 


• 5s e ff 
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(Em the projection matrix normalized in each 

annulus as 'P*m = 1; A m i is the fraction of the area of 
the mth cell lying within the it h annular bin (0 < Ami ^ 1)? 
and /mask ,i is the mask correction factor for the ith annular 

bin, (1 /masted) -1 = Emt 1 ~ A mi, with 

f m the fraction of the mask area in the mth cell, due to bad 
pixels, saturated objects, foreground and cluster galaxies. The 
intrinsic clustering plus statistical Poisson contributions to the 
uncertainty in (n Mj j) are given as 


(VE + ) = n _ f - 

1,1 /mask,* 


) 2 n 2 

1 “cell 


vL (c N ) r 


(ii) 

where (crjE) 2 ar, d (°'^ tat ) 2 account for the contributions from 
the first and second terms of Equation (9), respectively. 

In the present work, we use magnification measurements 
from flux-limited samples of red background galaxies ob¬ 
tained by Umetsu et al. (2014). The analysis procedure used 
in Umetsu et al. (2014) is summarized as follows: The mag¬ 
nification bias analysis was limited to the 24' x 24' region 
centered on the cluster. The clustering error term <j™\ was 
estimated empirically from the variance in each annulus due 
to variations of the counts along the azimuthal direction. For 
the estimation of (n^j), a positive tail of > vcr cells with 
v = 2.5 was excluded in each annulus by iterative a clip¬ 
ping to reduce the bias due to intrinsic angular clustering of 
source galaxies. The Poisson error term cr^'f was estimated 
from the clipped mean counts. The systematic change be¬ 
tween the mean counts estimated with and without cr clipping 

was then taken as a systematic error, tr® y ® = n'^) — n'^ > \/v, 

where n'Jj and ri'V * represent the clipped and unclipped 
mean counts in the ith annulus, respectively. As noted by 
Umetsu et al. (2014), the cr^ ys term is sensitive to large-scale 
variations of source counts and can in principle account for 
projection effects from background clusters along the line of 
sight and spurious excess counts due perhaps to spatial varia¬ 
tion of the photometric zero point and/or to residual flat-field 
errors. These errors were combined in quadrature as 


l,i = (c 


fi,i) 


AM 


+ (cr;. 


( 12 ) 


Since we include the cr® ys term in the error analysis, our mag¬ 
nification bias measurements are stable and insensitive to the 
particular choice of v. The smaller the v value, the larger the 
resulting total errors as dominated by the Poisson and system¬ 
atic terms. 

The count normalization and slope parameters (n^,s e s) 
were estimated from the source counts in the outskirts at 
10' < 6 < 0 max , with 0 max = 16' except 6» max = 14' for RX 
J2248.7—4431 observed with ESO/WFI (Umetsu et al. 2014). 
The mask-corrected magnification bias profile (n ;j ,;)/n M is 
thus proportional to (1 - / mask ,back)/(l - /mask,*) = 1 + 

In i ask ,? with /mask,back = /mask(10' < 9 < 0 max ) the 

masked area fraction in the reference background region. 
Masking of observed sky was accounted for using the method 
outlined in Umetsu et al. (2011b, Method B of Appendix 
A). This method can be fully automated to achieve similar 
performance to conservative approaches (e.g., Method A of 
Umetsu et al. 2011b) in terms of the masked area fraction 
once the configuration parameters of SExtractor (Bertin & 
Arnouts 1996) are optimally tuned. We tuned the SExtractor 
configuration parameters by setting DETECT_THRESH = 5 and 


DETECT_MINAREA = 300 as found by Umetsu et al. (2011b) 
to detect foreground objects, cluster galaxies, and defects 
(e.g., saturated stars and stellar trails) in the coadded images 
(0.2" pixel -1 sampling). We marked the pixels that belong 
to these objects in the CHECKIMAGE_TYPE = OBJECT mode. 
Recently, Chiu et al. (2016) adopted this method to calculate 
the masked area fraction for their magnification bias measure¬ 
ments, finding that the SExtractor configuration of Umetsu 
et al. (201 lb) is optimal for their observations with Megacam 
on the Magellan Clay telescope. 

We find that the unmasked area fraction 1 — / mas k 
is on average 93% of the sky at 9 > 10', decreas¬ 
ing toward the cluster center down to 88% at 1' 9 /$ 2' 

(210 kpc /i _1 R /$ 420 kpc h~ l at z = 0.35). The typical 
variation of the mask correction factor across the radial range 
is thus « max(/ mask ) - /mask,back ~ 5%, which is much 
smaller than the typical depletion signal Sn ll /n ll ~ —0.3 in 
the innermost radial bin [0.9', 1.2']. Hence, the uncertainty in 
the mask correction is of the second order. Furthermore, the 
net effect of the mask correction depends on the difference 

of the /nia.sk Values, A/ mask ,* ~ /mask,* /mask,back and is 

insensitive to the particular choice of the SExtractor configu¬ 
ration parameters. Accordingly, the systematic uncertainty on 
the mask correction is negligible. 

The magnification-bias profiles used in this study are pre¬ 
sented in Figure 2 of Umetsu et al. (2014). For all clus¬ 
ters, the estimated values and errors for n^, and s e ff 

are summarized in Table 4 of Umetsu et al. (2014). The ob¬ 
served values of s e fi range from 0.11 to 0.20, with a mean of 
(s e ff) = 0.153 and a typical uncertainty of 33% per clus¬ 
ter field. We marginalize over the calibration parameters 
((/3) m , s e ff) for each cluster in our joint likelihood anal¬ 
ysis of multiple lensing probes (Section 3.6.2). 

3.5. Strong Lensing 

Detailed strong-lens modeling using many sets of multiple 
images with known redshifts allows us to determine the criti¬ 
cal curves with great accuracy, which then provides accurate 
estimates of the projected mass enclosed within the critical 
area A c of an effective Einstein radius 0Ein = yjA c /n (Zitrin 
et al. 2015). Iri The enclosed projected mass profile 

M 2D (< 9) = SeA 2 [ K(8')d 2 0’ 

J\0'\<e (13) 

= 7t(A6') 2 Sc,ooKoo(< #)■ 

at the location 6 around is less sensitive to modeling as¬ 
sumptions and approaches (see Coe et al. 2010; Umetsu et al. 
2012; Oguri et al. 2012), serving as a fundamental observable 
quantity in the strong-lensing regime (Coe et al. 2010). 

In this work, we use the recent HST results from a joint 
analysis of CLASH strong- and weak-lensing data presented 
by Zitrin et al. (2015), who have obtained detailed mass mod¬ 
els for each cluster core using two distinct parameterizations 
(when applicable): One assumes light-traces-mass for both 
galaxies and DM, while the other adopts an analytical ellip¬ 
tical NFW form for the DM-halo components. Zitrin et al. 
(2015) performed a detailed comparison of the two models to 
obtain a realistic, empirical assessment of the true underly¬ 
ing errors, finding that the projected mass enclosed within the 

15 For an axisymmetric lens, the average mass density within this critical 
area is equal to S c , thus enabling us to directly estimate the enclosed pro- 
jected mass by M 2 d(< 0 Ein ) = *r(D 1 0 Ein ) 2 S c . 
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critical curves (z s = 2) agrees typically within ~ 15%. Zitrin 
et al. (2015) recommended replacing the statistical errors of 
their models with the actual (and much larger) uncertainties 
that account for model-dependent systematics. 

We combine for each cluster the constraints on M 2 d(< &) 
from two distinct mass models (where available) of Zitrin 
et al. (2015). To do this, we take a conservative approach that 
accounts for model-dependent systematic uncertainties. First, 
at each projected clustercentric distance 9, we improve our 
estimation of M 2 d(< 9) using the average of two different 
methods as M 2 d = {Mzi + Mz 2 )/2. Next, we combine the 
ler confidence intervals of the two models, by taking the max¬ 
imum range spanned by their respective confidence limits as 
an overall total uncertainty, <tm = [max(Mzi + erzi, Mz 2 + 
CZ 2 ) ^ min(Mzi — crzi, Mz 2 — 0z 2 )]/2. Finally, we rescale 
the error profile a m {9) such that the fractional uncertainty on 
M 2 f>{< $Ein) is 15%, as motivated by the findings of Zitrin 
et al. (2015). For all clusters, this has resulted in increased 
errors. 

Now, the question is how to determine an effective sampling 
radius A 9 of projected mass constraints M 2 d(< 9), avoid¬ 
ing oversampling and reducing correlations between adjacent 
bins. Following Coe et al. (2010), we estimate here the effec¬ 
tive resolution A 9 based on the surface density of observed 
multiple images as Ai m (A0) 2 = 7 r0| in with Aj m the number 
of multiple images. For our cluster sample, we find a median 
of N- lm = 17 multiple images per cluster and a median ef¬ 
fective Einstein radius of 0\r ]u = 22.3" for a fiducial source 
redshift of z s = 2 (Table 1), yielding A 9 ss 10". This is con¬ 
sistent with the typical map resolutions in the strong-lensing 
regime adopted by the SaWLens analysis of Merten et al. 
(2015, see their Table 5). For the X-ray-selected subsample, 
we find a median value of 0Ein = 20.1" at z s = 2. The max¬ 
imum integration radius is taken to be 40", which is equal to 
approximately twice the median Einstein radius (z s = 2), the 
region where multiple images form (see Section 3 of Zitrin 
et al. 2012b). 

To summarize, for each cluster except RXJ 1532.9+3021 
for which no secure identification of multiple images has been 
made (Zitrin et al. 2015), we obtain enclosed projected mass 
constraints {M 2Dyi }^ = {M 2D (< 0j)}iSi for a set of 
TVsl = 4 fixed integration radii 0* = 10", 20", 30", and 40" 
from the //5Tlensing analysis of Zitrin et al. (2015). A sum¬ 
mary of the 11ST leasing mass estimates is given in Table 1. 


3.6.2. Joint Likelihood Function 


In the Bayesian framework of Umetsu (2013, see also 
Umetsu et al. (2011b)), the lensing signal is described by 
a vector s of parameters containing the binned convergence 
profile {Koo,i}fli with N = Ag l + Awl* given by A binned 
k values and the average convergence enclosed by the inner¬ 
most aperture radius 9 m i n for strong-lensing mass estimates, 
K'oo.min = ^oo( + ^niin)^ SO that 


S = {t 


'oo,min? n '<x> 


— y-1 y 


(15) 


specified by (A + 1) parameters. We have TV = 14 for 
all clusters except A = A W l = 10 for RXJ1532.9+3021 
(Section 3.6.1). The number of degrees of freedom (dof) is 
TV G l — (TV + 1) = A W l — 1 = 9 for all clusters. 

The joint likelihood function £gl(s) for multi-probe lens¬ 
ing observations is given as a product of their separate likeli¬ 
hoods, 

£gl = £slAvl = £sl 116 ) 

with £sl, and the likelihood functions for 

{M 2 D,J 2 i\ {(+m)}S l > and {(n M)i )}^ L , respectively, 
defined as 


Nsc 


1 M [M 2 D,i — M 2 D,i{s)] 2 




ln£ s = 


ln£ M = 


1 [(g+.j) — g+,t( s ; c )Y 

2 hi a +,i 

1 y^ L [(n Mji ) - (s, c)] 2 

' 2 ^ (J 2 

i= 1 AM 


(17) 


with (M 2 o, g + , hfj) the theoretical predictions for the cor¬ 
responding observations (Appendix A) and c the calibration 
nuisance parameters to marginalize over, 

c = {{W) g ,fw,g,{ w )n^>s e s}. (18) 

For each parameter of the model s, we consider a flat un¬ 
informative prior with a lower bound of s = 0. Additionally, 
we account for the calibration uncertainty in the observational 
parameters c. 


3.6.3. Estimators and Covariance Matrix 


3.6. Cluster Mass Profile Reconstruction 
3.6.1. Lensing Constraints 

We consider multiple complementary lensing information 
available in the cluster regime, namely enclosed projected 
mass estimates from strong lensing, source-averaged tangen¬ 
tial distortion and magnification bias measurements: 

{M 2D>i }^f, {(<?+,i}& (<«/+)& (14) 

Hence, there are a total of 7V G l = TVsl + 2 Awl independent 
lensing constraints for each cluster. 

Umetsu et al. (2014) measured the shear and magnifica¬ 
tion effects in Awl = 10 log-spaced clustercentric radial 
bins over the range [0.9', 16'] for all clusters, except [0.9', 14'] 
for RX J2248.7—4431 observed with ESO/WFI (Section 3.4). 
We have Asl = 4 projected mass estimates (Table 1) from 
the HST lensing analysis of Zitrin et al. (2015) for all clusters 
except RXJ1532.9+3021 without strong-lensing constraints. 


We implement our method using a Markov Chain Monte 
Carlo algorithm with Metropolis-Hastings sampling follow¬ 
ing the prescription outlined in Umetsu et al. (2011b). The 
method has been tested (Umetsu 2013) with synthetic weak- 
lensing catalogs from simulations of analytical NFW lenses 
performed using the public package GLAFIC (Oguri 2010). 
The results suggest that, when the mass-sheet degeneracy is 
broken, both maximum-likelihood (ML) and marginal maxi¬ 
mum a posteriori probability (MMAP) solutions provide re¬ 
liable reconstructions with unbiased profile measurements. 
Hence, this method is not sensitive to the choice and form of 
priors. In the presence of a systematic bias in the background- 
density constraint ( n M ), the global ML estimator is less sen¬ 
sitive to systematic effects than MMAP, and provides more 
accurate reconstructions (Umetsu et al. 2014). 

16 If no strong-lensing constraint is available (iVgL = 0), re^mm rep¬ 
resents the average convergence within the inner radial boundary of weak- 
lensing observations, 0 m ; n = 0.9' (Umetsu et al. 2014). 
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On the basis of our simulations, we thus use the global ML 
estimator for determination of the mass profile. In our error 
analysis we take into account statistical, systematic, cosmic- 
noise, and intrinsic-variance contributions to the total covari¬ 
ance matrix C i: j = Cov(st, sfi) for = E/E CjOC as 

c , = c ,st a t + c ,sys +c ,l ss+c ,int ) (19) 

where G' st,at is the posterior covariance matrix that is derived 
from the data (Equations (16) and (17)) by calculating the 
sample covariance matrix Cfj at = ((st — (si))(sj — (Sj ))) 
using MCMC-sampled posterior distributions; C ays accounts 
for systematic errors due primarily to the mass-sheet uncer¬ 
tainty (Umetsu et al. 2014), 

( C sys )ij = (s M l — smmap)^!j, (20) 

with sml and smmap the ML and MMAP solutions, respec¬ 
tively; C laa is due to uncorrelated large-scale structure (LSS) 
projected along the line of sight (Hoekstra 2003; Umetsu et al. 
2011a); C lnt accounts for the intrinsic variations of the pro¬ 
jected cluster mass profile (Gruen et al. 2015). 

The cosmic-noise covariance due to projected uncorrelated 
LSS is given as (Umetsu et al. 2011a) 

(c lss h = (2i) 

where is the weak-lensing cross power spectrum as a 
function of angular multipole l evaluated for a given pair of 
source populations in the ith and jth radial bins; Jo(l9i) is the 
Bessel function of the first kind and order zero (Jo) averaged 
over the ith annulus between 0i t i and 9i^{> 0i,i), given as 
(Umetsu et al. 201 la) 

Jo(lQi) = 777 — 75 — 777—77 \l 9 i^Ji{l 9 i^) — l 9 i,iJi(l 9 i.i)\ ■ 
(Wi,2) — \l 9 i, 1 ) 

( 22 ) 

We compute the elements of the C laa matrix for a given pair 
of source populations, using the nonlinear matter power spec¬ 
trum of Smith et al. (2003) for the Wilkinson Microwave 
Anisotropy Probe ( WMAP ) seven-year cosmology (Komatsu 
et al. 2011), and then scale to the reference far-background 
source plane. For the inner radial bins constrained by HST 
strong lensing, we assume a source plane at z s = 2, which 
is a typical redshift of strongly lensed background galaxies 
(Zitrin et al. 2015). For the outer weak-lensing bins, we use 
for each cluster the effective mean source redshift (5 e ff) esti¬ 
mated with multi-band photometric redshifts (see Table 3 of 
Umetsu et al. 2014). The cross covariance terms between the 
strong- and weak-lensing bins are computed using the lens¬ 
ing window functions of the respective populations (Takada 
& White 2004). For a given depth of observations, the impact 
of cosmic noise is most important where the cluster signal it¬ 
self is small (Hoekstra 2003). 

The intrinsic covariance matrix C' ut accounts for the in¬ 
trinsic variations of the projected cluster density profile (i.e., 
cluster signal itself) due to the c-M variance, halo aspheric- 
ity, and the presence of correlated halos (Gruen et al. 2015). 17 
In the present work, we consider the diagonal form of the C lnt 

17 Omen et al. (2015) formally include the contribution from the projected 
uncorrelated LSS in C mt . In this study, we separately account for this exter¬ 
nal contribution as C lss , to be consistent with the procedure in Umetsu et al. 
(2014). 


matrix, 

(C m %- = af nt x (sml)^ij, (23) 

where the coefficient amt ~ \J Cj/V^oo.i represents the frac¬ 
tional intrinsic scatter in k. On the basis of semi-analytical 
calculations calibrated by cosmological numerical simula¬ 
tions (Gruen et al. 2015), we find that, for CLASH clusters 
with a characteristic mass of M 200 C ~ 10 15 Mq /i _ 1 (Umetsu 
et al. 2014), the diagonal part of the C mt matrix can be well 
approximated by Equation (23) with o: illt ss 0.2 in the one- 
halo regime at R fiz r 2 oom- In general, the diagonal approxi¬ 
mation to C mt can lead to an underestimate of cluster parame¬ 
ters (see Gruen et al. 2015), where the degree of underestima¬ 
tion depends on the radial binning scheme for cluster lensing 
measurements. 

We have tested the validity of this approximation in our ra¬ 
dial binning scheme by comparing against the semi-analytical 
model of Gruen et al. (2015). For a cluster halo of A^ooc = 
10 15 M e h- 1 at zi = 0.35 (M 200 m ^ 13 x 1O 14 M 0 /i" 1 ) 
matching approximately the average characteristics of the 
CLASH sample (Umetsu et al. 2014), we find that the total 
S/N estimated using the diagonal approximation with amt = 
0.2 is accurate to about 10% in the regime of our cluster lens¬ 
ing observations (S/N ~ 10 per cluster), where the S/N is 
defined as (Umetsu & Broadhurst 2008) 18 

(S/N) 2 = £ s,G,;.s, = s t C~ 1 s, (24) 

with C the total covariance matrix defined in Equation (19). 
In the present study, we thus adopt amt = 0.2 in Equation 
(23) to account for the effects of the intrinsic profile variations 
in projection space. 19 

4. CLASH INDIVIDUAL CLUSTER ANALYSIS 

In this section we carry out a comprehensive analysis of 
strong-lensing, weak-lensing shear and magnification data 
sets for all 20 CLASH clusters in our sample. 

4.1. CLASH Mass Profile Reconstruction 

Following the methodology outlined in Section 3, we ana¬ 
lyze our weak- and strong-lensing data presented in Umetsu 
et al. (2014) and Zitrin et al. (2015) to examine the underlying 
radial mass distribution for a sample of 20 CLASH clusters 
(Table 1). We have derived for each cluster a mass-profile so¬ 
lution E = E CjOC s = {E m m, Ej}^ from a joint likelihood 
analysis of our strong-lensing, weak-lensing shear and mag¬ 
nification data. We find that the minimum x ' 2 ( = — 21ii£gl) 
values for the best-fit E solutions range from \ 2 = 2.5 (Abell 
611) to 14.8 (MACSJ0429.6—0253) for 9 dof (a mean re¬ 
duced x 2 °f 0.95), indicating good consistency between in¬ 
dependent observations having different systematics. 

In Appendix B (Figure 11) we show the resulting mass- 
profile solutions, S (black squares), obtained for our sam¬ 
ple along with those by Umetsu et al. (2014, blue circles) 

18 We note that this classical S/N definition beaks down in the noise- 
dominated regime. In our analysis, the binning scheme was chosen such that 
the pel-pixel detection S/N is ^ 1 for each cluster (Umetsu et al. 2014), and 
thus the noise contribution to this estimator is negligibly small. 

15 Strictly speaking, when simultaneously determining the mass and con¬ 
centration for a given individual cluster, the contribution from the intrinsic 
c—M variance should be excluded from C lnt . To simplify the procedure, 
however, we shall fix amt = 0.2 throughout this study. We note that the 
effect of the c—M variance becomes important only at small cluster radii, 
e < 2' (Gruen et al. 2015). 
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Clustercentric radius 9 [arcmin] 

Figure 1. Contributions to the total covariance matrix C = C stat + C sys + 
C lss + C lnt (black line with squares) for the binned cluster convergence 
profile k(9) (see Figure 11) reconstructed from a joint likelihood analysis of 
strong-lensing, weak-lensing shear and magnification data sets. The results 
are obtained by averaging over 18 clusters observed with both HST and Sub¬ 
aru (i.e., all except RXJ1532.9+3021 without strong-lensing constraints and 
RXJ2248.7—4431 based on ESO/WFI data). The diagonal variance terms 
Var(+), scaled to the mean depth of Subaru weak-lensing observations, are 
shown as a function of clustercentric radius 9. 




Figure 2. Cross-correlation coefficients of the ensemble-averaged C stat 
(left panel) and C lss (right panel) matrices, whose diagonal elements are 
shown in Figure 1. 

and Merten et al. (2015, red dots). Umetsu et al. (2014) 
derived weak-lensing-only solutions for this sample from a 
joint likelihood analysis of the shear and magnification mea¬ 
surements. When the inner strong-lensing information (Table 
1) is combined with wide-held weak-lensing data, the cen¬ 
tral weak-lensing bin E(< 0.9') (Section 3.6.1) is resolved 
into (IVsl + 1) radial bins, hence improving the determina¬ 
tion of the inner mass profile. Merten et al. (2015) performed 
a two-dimensional SaWLens analysis of 19 CLASH X-ray- 
selected clusters, by combining the CLASH HST data with 
the shear catalogs of Umetsu et al. (2014). To break the mass- 
sheet degeneracy, Merten et al. (2015) assumed outer bound¬ 
ary conditions such that the average convergence vanishes at 
the edge of the reconstruction. This is a reasonable approxi¬ 
mation in wide-held weak-lensing observations entailing the 
full cluster held well beyond its virial radius, r v - 1T . 

We hnd overall good agreement between different recon¬ 
structions, except for a few systems in the overlap sam¬ 
ple, such as MACSJ1931.8-2635, RXJ1347.5-1145, and 


MACSJ0744.9+3927. For these clusters, the SaWLens 
reconstructions are systematically lower than those of this 
work and Umetsu et al. (2014) which include the weak- 
lensing magnihcation data. Here MACS J1931.8—2635 (b = 
-20.09°) and MACSJ0744.9+3927 (b = +26.65°) are the 
two lowest Galactic latitude clusters of the overlap sample, 
implying higher stellar densities, correspondingly large ar¬ 
eas masked by bright saturated stars, and hence lower num¬ 
ber densities of background galaxies usable for weak lensing 
(Umetsu et al. 2014, see their Figure 2, Tables 3 and 4). In 
fact, MACSJ 1931.8—2635 has the lowest S/N of the weak- 
lensing observations presented in Umetsu et al. (2014, Table 
5). On the other hand, RXJ1347.5-1145 (z = 0.451) and 
MACSJ0744.9+3927 (z = 0.686) represent the two highest- 
z clusters of the overlap sample, both of which exhibit com¬ 
plex mass distributions with a high degree of substructure (see 
Postman et al. 2012; Merten et al. 2015). For clusters at lower 
Galactic latitudes and higher redshifts, the color selection of 
background galaxies is correspondingly more difficult. There¬ 
fore, this discrepancy can be attributed in part to systematic 
uncertainties in the present calibration of magnification mea¬ 
surements for these low Galactic latitude clusters and high 
redshift clusters. Weak-lensing mass reconstructions are sen¬ 
sitive to the treatment of boundary conditions if there are mas¬ 
sive structures near the data boundaries. Hence, mass profile 
reconstructions for clusters with high degrees of substructure 
can be subject to a greater degree of mass-sheet degeneracy. 
A more quantitative comparison with the SaWLens results 
from Merten et al. (2015) can be found in Sections 7.3 and 
7.4. 

Figure 1 shows the average contributions to the total 
covariance matrix C (Equation (19)) for the convergence 
profile n{9) reconstructed from the joint likelihood anal¬ 
ysis of strong-lensing, weak-lensing shear and magnifica¬ 
tion data. The results are obtained by averaging over 18 
clusters observed with both HST and Subaru (i.e., all ex¬ 
cept RXJ1532.9+3021 without strong-lensing constraints and 
RXJ2248.7—4431 based on ESO/WFI data). The diagonal 
variance terms Var(re), scaled to the mean depth of Subaru 
weak-lensing observations, are shown as a function of clus¬ 
tercentric radius 9. At all bins except the innermost bin, the 
reconstruction uncertainty is dominated by the observational 
statistical errors (C stat , red dashed). The relative contribu¬ 
tion from intrinsic variance (C lnt , gray solid) increases to¬ 
ward the cluster center and becomes important at small clus¬ 
ter radii, 9 £ 2'. The C sys term (green dotted) represents 
the level of residual variance (Equation (20)) due primar¬ 
ily to the mass-sheet degeneracy. In the regime of Subaru 
weak-lensing (9 > 0.9'), C sys stays approximately constant 
at ~ (2 — 4) x 10~ 4 with 9, corresponding to a characteristic 
mass-sheet uncertainty of cr K ~ (1 — 2) x 10 -2 per cluster. 
A noticeable increase of the cosmic-noise contribution C lss 
(blue dotted-dashed) from projected uncorrelated LSS is seen 
at 9 < 40", within which the reconstruction is dominated by 
HST strong-lensing measurements with greater depth. 

In Figure 2, we show the cross-correlation coefficients of 
the ensemble-averaged 6 ,stat and C lss matrices, whose di¬ 
agonal elements are shown in Figure 1. We find that ad¬ 
jacent HST bins of the (C ,stat ) matrix are anti-correlated at 
around the 40% level, where the first 4 bins in each panel of 
Figure 2 correspond to the HST data. This negative covari¬ 
ance arises because they are to satisfy the observed cumula¬ 
tive mass constraints (Equation (13)). We find small corre- 
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CLASH: Strong-lensing, Weak-lensing Shear and Magnification Analysis 


Table 2 

NFW halo parameters for individual CLASH clusters 


Cluster 

AT 200 C 

C200c 

r -2 


(1O 14 M 0 /r- 1 ) 


(Mpc h~l) 


X-ray Selected: 


Abell 383 

7.98 

± 

2.66 

5.9 

± 

1.8 

0.31 

± 

0.13 

Abell 209 

15.40 

± 

3.42 

2.7 

± 

0.6 

0.84 

± 

0.22 

Abell 2261 

23.10 

± 

5.22 

3.7 

± 

0.9 

0.69 

± 

0.20 

RXJ2129.7+0005 

6.14 

± 

1.79 

5.6 

± 

1.6 

0.30 

± 

0.11 

Abell 611 

15.76 

± 

4.49 

3.9 

± 

1.2 

0.57 

± 

0.21 

MS2137-2353 

13.56 

± 

5.27 

2.7 

± 

1.3 

0.80 

± 

0.45 

RXJ2248.7-4431 

18.78 

± 

6.72 

3.6 

± 

1.4 

0.66 

± 

0.32 

MACSJ1115.9+0129 

16.66 

± 

3.85 

3.0 

± 

0.8 

0.75 

± 

0.23 

MACSJ1931.8-2635 

15.28 

± 

7.13 

4.4 

± 

1.9 

0.51 

± 

0.30 

RXJ1532.9+3021 

5.98 

± 

2.32 

5.2 

± 

2.8 

0.29 

± 

0.18 

MACSJ1720.3+3536 

14.50 

± 

4.30 

4.1 

± 

1.3 

0.51 

± 

0.20 

MACSJ0429.6-0253 

9.76 

± 

3.50 

4.6 

± 

1.7 

0.40 

± 

0.19 

MACSJ 1206.2-0847 

18.17 

± 

4.23 

3.7 

± 

1.1 

0.60 

± 

0.21 

MACSJ0329.7-0211 

8.65 

± 

1.97 

6.7 

± 

1.6 

0.26 

± 

0.08 

RXJ1347.5-1145 

34.25 

± 

8.78 

3.2 

± 

0.9 

0.85 

± 

0.29 

MACSJ0744.9+3927 

18.03 

± 

4.96 

3.5 

± 

1.2 

0.58 

± 

0.23 


High Magnification: 

M ACS J0416.1 -2403 

10.74 ± 2.60 

2.9 ±0.7 

0.65 ±0.18 

MACSJl 149.5+2223 

25.02 ± 5.53 

2.1 ± 0.6 

1.12 ±0.35 

MACSJ0717.5+3745 

26.77 ± 5.36 

1.8 ±0.4 

1.31 ± 0.31 

MACSJ0647.7+7015 

13.90 ± 4.20 

4.1 ± 1.5 

0.48 ±0.21 


Note. — Cluster parameters derived from single spherical NFW fits to individ¬ 
ual surface mass density profiles (Figure 11) reconstructed from combined strong- 
lensing, weak-lensing shear and magnification measurements. We adopt a concor¬ 
dance cosmology of h = 0.7, = 0.27, and Qa = 0.73. The fitting radial 

range is restricted to R < 2 Mpc h~ 1 ~ 2.9 Mpc . 

lations of < 1% between the HST (10" < 9 < 40") and 
Subaru (0.9' < 9 < 16') radial bins. For the (C lss ) matrix, 
since strongly lensed source galaxies at z s ~ 2 and weakly 
lensed source galaxies at z s ~ 1 behind a given cluster share 
the same mass overdensities at z ^ 1 where the geometrical 
lensing efficiency for the z s ~ 2 sources is large, there are 
large positive correlations at the ~ 70% — 20% levels between 
the HST and Subaru bins, where the degree of correlation in¬ 
creases with decreasing projected separation. 

4.2. Weighing CLASH Clusters 

Tangential shear fitting with a spherical NFW profile is a 
standard approach for measuring individual cluster masses 
from weak lensing (Umetsu et al. 2009; Okabe et al. 2010; 
Applegate et al. 2014). Numerical simulations suggest that 
these mass estimates tend to be biased low (by ~ 5%-10%; 
see Meneghetti et al. 2010; Becker & Kravtsov 2011; Rasia 
et al. 2012) because local substructures that are abundant in 
cluster outskirts dilute the shear tangential to the cluster cen¬ 
ter. This bias can be reduced if the fitting range is restricted to 
within ~ 2r5ooc ~ r v i r (Becker & Kravtsov 2011). In the 
CLASH survey, the availability of multi-probe, multi-scale 
lensing data allows us to combine weak shear lensing with 
magnification and/or strong-lensing constraints (Umetsu et al. 
2014; Merten et al. 2015). In particular, the complementary 
combination of shear and magnification data enables to ef¬ 
fectively break the mass-sheet degeneracy and reconstruct the 
total mass distribution £ (Umetsu et al. 201 lb). 

Here we revisit the mass estimates for CLASH clusters us¬ 
ing our improved mass profile data set (Figure 11) and tak¬ 
ing into account the intrinsic contribution (C mt ) to the error 
covariance matrix (Section 3.6.3). We follow the Bayesian 
approach of Umetsu et al. (2014) to make inference on the 
NFW halo parameters. We employ the radial dependence 
of the projected NFW profiles given by Wright & Brain- 


erd (2000), which provides a good description of the pro¬ 
jected mass distribution in the one-halo regime, at least in 
an ensemble-average sense (Oguri & Hamana 2011; Umetsu 
et al. 2011a; Okabe et al. 2013). We restrict the fitting range 
to R < 2 Mpc h~ l (Umetsu et al. 2014; Merten et al. 2015), 
which is close to the virial radius for most CLASH clusters. 
We specify the two-parameter NFW model using the halo 
mass M 200 C and the halo concentration C 200 C = T 200 c/t -2 
with r _2 the characteristic radius at which the logarithmic 
density slope is -2 (Section 5.2.1). We adopt uninforma¬ 
tive log-uniform priors in the respective intervals, 0.1 < 
M 200 c /(10 15 M o hr 1 ) < 10 and 0.1 < C 200 C < 10 (Umetsu 
et al. 2014). We check that the mass and concentration esti¬ 
mates for the CLASH sample are not sensitive to the choice of 
the priors as found by Sereno et al. (2015b, see their Section 
2.1). The x 2 function for our observations is 

X 2 (p) = J2 [st - SiO)] C- 1 [sj - Sj{p )], (25) 

i>3 

where p = (M 20 oc, c 2 ooc), and Si(p) = £i(p)/£ Ci00 is the 

predicted surface mass density averaged over the Zth annu¬ 
lus, accounting for the effect of bin averaging (Umetsu et al. 
2014). 

In Table 2, we give marginalized constraints on the NFW 
halo parameters (M 2000 c 2 ooc) and the characteristic radius 
r_ 2 . Throughout this paper, we employ the biweight estima¬ 
tors of Beers et al. (1990) for the central location (Obi) and 
scale (Sbi) of the marginalized posterior distributions (Sereno 
& Umetsu 2011; Umetsu et al. 2014, 2015). From the poste¬ 
rior samples, we also derive marginalized constraints on the 
total enclosed mass Ma = M 3 d(< r a) at several character¬ 
istic interior overdensities A (see Section 1). Table 3 sum¬ 
marizes the results of our cluster mass estimates. The me¬ 
dian precision on our lensing mass measurements is found to 
be ~ 28% at A c = 200, - 24% at A c = 500, ~ 23% at 
A c = 1000, and ~ 24% at A c = 2500. 

In the CLASH weak-lensing study of Umetsu et al. (2014), 
the C lnt contribution (Equation 23) to the total covariance 
matrix (Equation (19)) was not included in their individual 
cluster mas measurements at A c < 500. Accordingly, their 
individual mass measurement errors were underestimated by 
~ 20% at A c = 200, on average (see also Figure 5 of Gruen 
et al. 2015), even though C' lnt is less important at larger clus¬ 
ter radii (Figure 1). 

On the other hand, the stacked cluster measurements of 
Umetsu et al. (2014) empirically account for the cluster-to- 
cluster profile variations from bootstrap resampling of the 
cluster sample. 

5. CLASH STACKED MASS PROFILE ANALYSIS 

Stacking an ensemble of clusters helps average out the pro¬ 
jection effects of halo asphericity and substructure, as well as 
the cosmic noise from projected uncorrelated LSS. The statis¬ 
tical precision can be improved by stacking a large number of 
clusters, allowing a tighter comparison of the averaged lens¬ 
ing signal with theoretical models (Okabe et al. 2010, 2013; 
Umetsu et al. 201 la, 2014; Miyatake et al. 2015). 

Hereafter, our analysis will focus on the X-ray-selected 
subsample of 16 CLASH clusters, which comprises a popula¬ 
tion of high-mass X-ray regular clusters. The X-ray selection 
of this subsample is optimized for radial profile measurements 
(Postman et al. 2012). Numerical simulations suggest that the 
CLASH X-ray-selected subsample is prevalently composed 
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Table 3 

Mass estimates for individual CLASH clusters 


Cluster 

M 2500 C 

Tfioooc 

T?500c 

M vir “ 

Miooc 

M200m 

M(< 1.5Mpc) 


(10 14 M 0 ) 

(10 14 M o ) 

(1O 14 M 0 ) 

(10 14 M o ) 

(10 14 M 0 ) 

(1O 14 M 0 ) 

(10 14 M o ) 


X-ray Selected: 


Abell 383 

2.78 

± 

0.63 

4.43 

± 

1.16 

5.88 

± 

1.73 

9.41 

± 

3.33 

9.66 

± 

3.45 

10.34 

± 

3.78 

6.95 

± 

1.63 

Abell 209 

2.95 

± 

0.68 

6.18 

± 

1.25 

9.64 

± 

1.97 

19.60 

± 

4.61 

20.49 

± 

4.88 

22.36 

± 

5.44 

10.28 

± 

1.37 

Abell 2261 

5.91 

± 

1.03 

10.85 

± 

1.89 

15.65 

± 

3.05 

28.21 

± 

6.87 

29.39 

± 

7.26 

31.41 

± 

7.94 

14.22 

± 

1.79 

RXJ2129.7+0005 

2.08 

± 

0.44 

3.35 

± 

0.78 

4.48 

± 

1.16 

7.21 

± 

2.23 

7.48 

± 

2.34 

7.87 

± 

2.51 

5.75 

± 

1.23 

Abell 611 

4.13 

± 

0.92 

7.48 

± 

1.67 

10.73 

± 

2.65 

18.95 

± 

5.77 

19.99 

± 

6.21 

20.78 

± 

6.54 

11.24 

± 

1.95 

MS2137-2353 

2.47 

± 

0.72 

5.21 

± 

1.41 

8.28 

± 

2.57 

16.98 

± 

7.31 

18.25 

± 

8.12 

18.91 

± 

8.54 

9.67 

± 

2.12 

RXJ2248.7-4431 

4.45 

± 

1.05 

8.42 

± 

2.08 

12.45 

± 

3.62 

22.54 

± 

8.78 

24.12 

± 

9.68 

24.53 

± 

9.91 

12.67 

± 

2.50 

MACSJ1115.9+0129 

3.50 

± 

0.82 

7.02 

± 

1.43 

10.67 

± 

2.22 

20.30 

± 

4.97 

21.88 

± 

5.48 

22.24 

± 

5.60 

11.55 

± 

1.61 

MACSJ1931.8-2635 

4.10 

± 

1.12 

7.36 

± 

2.36 

10.51 

± 

4.05 

18.02 

± 

9.05 

19.18 

± 

9.88 

19.44 ± 10.07 

11.23 

± 

3.07 

RXJ1532.9+3021 

1.83 

± 

1.01 

3.03 

± 

1.38 

4.17 

± 

1.71 

7.04 

± 

2.79 

7.51 

± 

3.02 

7.58 

± 

3.06 

5.81 

± 

1.63 

MACSJ1720.3+3536 

3.92 

± 

0.86 

7.00 

± 

1.59 

9.96 

± 

2.53 

17.04 

± 

5.39 

18.29 

± 

5.94 

18.30 

± 

5.94 

11.02 

± 

2.05 

MACS10429.6-0253 

2.85 

± 

0.70 

4.92 

± 

1.32 

6.85 

± 

2.10 

11.35 

± 

4.33 

12.15 

± 

4.76 

12.13 

± 

4.74 

8.40 

± 

2.03 

MACSJ1206.2-0847 

4.62 

± 

1.01 

8.47 

± 

1.63 

12.24 

± 

2.49 

21.35 

± 

5.29 

23.18 

± 

5.94 

22.82 

± 

5.81 

12.98 

± 

1.84 

MACSJ0329.7-0211 

3.29 

± 

0.64 

5.02 

± 

1.00 

6.51 

± 

1.37 

9.72 

± 

2.30 

10.35 

± 

2.50 

10.21 

± 

2.45 

7.94 

± 

1.36 

RXJ 1347.5-1145 

7.65 

± 

1.63 

14.91 

± 

2.98 

22.33 

± 

4.89 

40.66 ± 11.13 

44.50 ± 12.60 

43.59 ± 12.25 

19.33 

± 

2.61 

MACSJ0744.9+3927 

4.32 

± 

1.02 

8.12 

± 

1.76 

11.94 

± 

2.81 

20.57 

± 

5.98 

23.21 

± 

7.08 

21.32 

± 

6.29 

13.96 

± 

2.39 


High Magnification: 
MACSJ0416.1-2403 

2.21 ± 0.53 

4.48 ± 0.97 

6.85 ± 1.52 

12.99 ±3.29 

14.13 ±3.66 

14.12 ± 3.66 

8.74 ± 1.37 

MACSJl 149.5+2223 

3.73 ± 1.11 

8.74 ± 1.98 

14.57 ±3.06 

30.45 ± 7.06 

34.72 ± 8.36 

32.62 ± 7.71 

15.31 ± 1.95 

MACSJ0717.5+3745 

3.42 ±0.87 

8.61 ± 1.74 

14.98 ±2.85 

32.97 ± 6.88 

37.95 ±8.17 

35.46 ± 7.52 

15.63 ± 1.75 

MACSJ0647.7+7015 

3.72 ±0.98 

6.65 ± 1.65 

9.49 ±2.51 

15.91 ± 5.07 

17.59 ± 5.83 

16.64 ±5.40 

11.37 ±2.26 


Note. — Cluster mass estimates M 3 d(< r) from single spherical NFW fits to individual surface mass density profiles (Figure 11). All quantities in the table are 
given in physical units assuming a concordance cosmology of h = 0.7, = 0.27, and = 0.73. See Table 2 for Af^ooc- The fitting radial range is restricted to 

R < 2 Mpc h -1 ~ 2.9Mpch“ 0 1 . 

a Virial overdensity A v i r based on the spherical collapse model (see Appendix A of Kitayama & Suto (1996)). For our redshift range 0.187 < z < 0.686, A v i r ranges 
approximately from ~ 110 to 140 with respect to the critical density of the universe at the cluster redshift. 


of relaxed clusters (~ 70%) and largely free of orientation 
bias (Meneghetti et al. 2014). 20 The selection criteria based 
on X-ray morphology also ensure well-defined cluster centers 
(Section 3.1). The four high-magnification clusters are thus 
excluded from this part of the analysis (Sections 5 and 6 ). 


5.1. Stacked Cluster Leasing Signal 

With a given set of surface mass density profiles for in¬ 
dividual clusters, we can stack them together to produce an 
ensemble-averaged radial profile. Following the prescription 
of Umetsu et al. (2011a), we re-evaluate the X profiles of in¬ 
dividual clusters in physical (proper) length units, using the 
same radial grid for all clusters. Stacking an ensemble of clus¬ 
ters (n = 1, 2,...) is expressed as (Umetsu et al. 201 la) 

= (26) 

where W n is the sensitivity matrix of the nth cluster, 

= (C-%, (27) 

with E( c ,oo),n the far-background critical surface mass density 
and C n the total covariance matrix (Equation (19)) for the nth 
cluster . 21 The error covariance matrix for the stacked ((X)) 
profile is given by (Umetsu et al. 2011a) 


C= (Z) W "J ■ f28) 

20 Meneghetti et al. (2014) showed that, for this subsample, the median 
angle between the major axis of the halos and the X-ray-selected sight lines 
is ~ 57°, compared to ~ 54° expected for a distribution of random orienta¬ 
tions. 

21 Since the covariance matrix C is defined for the far-background con¬ 
vergence Koo, the associated critical surface mass density too is a far- 
background quantity, S Ci00 = E c (z -4 oo). 


The weight matrix W n is mass-independent when stacking in 
physical length units (Okabe et al. 2010, 2013; Umetsu et al. 
2011a, 2014), which makes the effective halo mass extracted 
from the averaged lensing signal a good proxy for the mean 
population value (see Section 5.4; Johnston et al. 2007; Okabe 
et al. 2013; Umetsu et al. 2014; Sereno et al. 2015c). On the 
other hand, stacking in length units scaled to ta weights the 
contribution of each cluster to each radial bin in a nonlinear 
and model-dependent manner (Okabe et al. 2013), such that 
tr(yV) ex ex M^ 3 when C is dominated by the statistical 
noise contribution 6 ' stat . 

In Figure 3 we show the resulting ((X)) profile averaged in 
13 radial bins. The innermost bin represents the mean density 
interior to f? m i n = 40kpc/i _1 , which corresponds approx¬ 
imately to the typical resolution limit of our strong-lensing 
data (0 m i n = 10"; Section 3.6.2 and Figure 11). Since R m j n 
is much larger than the rms offset between the BCG and X- 
ray peak, a Q g ~ llkpc/r -1 (Section 3.1), the miscentering 
effects on the ((£)) profile are expected to be insignificant 
for the X-ray-selected subsample (Umetsu et al. 2014). The 
other bins are logarithmically spaced over the range R = 
[R™ in. Rmax] = [40, 4000] kpC ft -1 (0.02 < -R/r200m ~ 2), 
spanning two decades in radius. For this sample, we find a 
sensitivity-weighted average redshift of (( 24 )) ~ 0.34, in close 
agreement with the median redshift of ~Z\ ~ 0.35. We detect 
the stacked lensing signal at a total S/N of ~ 33 using the total 
covariance matrix C including the statistical, systematic, pro¬ 
jected uncorrelated LSS, and intrinsic-variance contributions 
(Section 3.6.3). 

5.2. Modeling the Stacked Lensing Signal 

We quantify and characterize the ensemble-averaged mass 
distribution of our X-ray-selected subsample using the ((E)) 
profile (Section 5.1). To interpret the observed averaged lens¬ 
ing signal, we consider the line of sight projected surface mass 
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Figure 3. Upper panel: ensemble-averaged surface mass density ((£)) (black squares) of the X-ray-selected subsample of 16 clusters, which is obtained by 
stacking individual S profiles (gray lines; Figure 11) derived from the joint analysis of HST and Subaru lensing data sets. The red-shaded area shows the la- 
confidence region of the three-parameter halo model fit (NFW+LSS (ii). Table 4). The projected NFW model (cyan-shaded area, la) slightly underpredicts the 
total mass profile relative to the halo model at R ^ 7 * 200 m • The scale on the right vertical axis indicates the corresponding lensing convergence scaled to the 
mean depth of weak-lensing observations. Lower panel: The logarithmic slope dln((E))/dln R (black squares) is shown along with the two best-fit models in 
the upper panel. 


density around the cluster center, £(f?) = Jdl Ap(i'), with 
A p(r) = p{r)—p m the mass overdensity. In the regime where 
R ;$ r 2 oom, S(i?) is dominated by the cluster halo contribu¬ 
tion ph(r), so that £(i?) ~ 2 f£°dl Ph(r)- 
As shown in Figure 3, our weak- and strong-lensing data 
together cover a wide range of clustercentric distances R, ex¬ 
tending out to i?max = 4000 kpc /i _1 ss 2r2oom- In the con¬ 
text of the ACDM model, the two outermost radial bins lie 
in the transition between the one-halo and two-halo regimes 
(Cooray & Sheth 2002), r 2 oom ~ R ~ 2i? 2 oom, where the 
large-scale two-halo contribution to £(f?) is expected to be¬ 
come important (Oguri & Hamana 2011; Beraldo e Silva et al. 
2013; Umetsu et al. 2014). 22 In this work, we thus test models 
both with and without including the two-halo term. 


sity slope is isothermal, that is, dlnph(r)/dlnr = —2 at 
r = r_ 2 - In analogy to the NFW concentration parameter, the 
degree of concentration is defined by C 2 oo c = r 200 c/f- 2 - We 
use M 200 C and C 200 C as fitting parameters, when possible. 

1. Generalized NFW (gNFW) model (Zhao 1996): 


(r/r s )7c(l+ r /r s )3-Tc’ (29 ) 

T- 2 = (2 - 7c )r s , 

with 7 C the central slope, p s and r s the characteris¬ 
tic density and radius, respectively. For 7 C = 1, this 
reduces to the standard NFW model Pnfw(t) with 
r s = r-2- 


5.2.1. Halo Density Profiles 

We give here a brief description of the halo profile mod¬ 
els that we consider. For each model Ph{r), the halo mass is 
defined using a spherical overdensity A c = 200 as M 2 qo c . 
We introduce the radius r _2 at which the logarithmic den- 

22 On the other hand, the tangential shear 7 - 1 - = AE/E C is insensitive to 
the projected two-halo term in the transition regime (Oguri & Hamana 2011). 


2. Einasto model (Einasto 1965): 


p h (r) = p- 2 ex p 



(30) 


with ctE the shape parameter describing the degree of 
curvature and p _2 = ph{r- 2 )- An Einasto profile with 
cie ~ 0.18 closely resembles the NFW profile over 
roughly two decades in radius (Ludlow et al. 2013). 
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6. Power-law sphere model: 

Ph(r)cxr _7c . (34) 

This model includes the singular isothermal sphere 
model with 7 C = 2. 

The NFW, gNFW, and Einasto density profiles represent 
a family of phenomenological models for cuspy DM halos 
motivated by numerical simulations and observations. The 
DARKexp model describes the distribution of particle ener¬ 
gies in finite, self-gravitating, collisionless, isotropic systems, 
providing theoretical predictions for the structure of colli¬ 
sionless DM halos. For radii accessible to A'-body simula¬ 
tions, DARKexp allows for central slope values in the range 
—2 1$ (1 In pt jdhir < 0. The empirical PI and Burkert mod¬ 
els describe cored density profiles. The power-law model is 
often adopted as a lens model for its simplicity (e.g., Koop- 
mans et al. 2009; Agnello et al. 2013). 


Figure 4. Upper panel: Comparison of models to the ensemble-averaged 
surface mass density ((E)) (black squares) obtained for the X-ray-selected 
subsample of 16 clusters. Models with PTE > 0.05 are shown with solid 
lines, while those with PTE < 0.05 are shown with dashed lines (see Ta¬ 
ble 4). The red solid curve shows the best-fit two-parameter halo model 
(NFW+LSS (i). Table 4), including the effects of surrounding LSS as a two- 
halo term assuming the halo bias function of Tinker et al. (2010) in the WMAP 
seven-year cosmology. The lower panel shows the deviations A (in units of 
cr) of the best-fit profiles with respect to the observed ((E)) profile. 


3. DARKexp -7 model. DARKexp is a theoretically de¬ 
rived model for collisionless self-gravitating systems 
with isotropic velocity distributions (Hjorth & Williams 
2010; Williams & Hjorth 2010). We use Dehnen- 
Tremaine 7 -models (Dehnen 1993; Tremaine et al. 
1994) as an analytic fitting function for the DARKexp 
density profile (Hjorth et al. 2015): 

Ph ^ (r/r s )'>' c (l + r/r s ) 4-7c ’ 

r -2 = (1 - 7c/2)r s , *- 31 ) 

7 C « 3log 10 (f>o — 0.65 (1.7 < (j) 0 < 6), 

where p s and r s are the scale density and radius, respec¬ 
tively, and cj )0 represents the dimensionless depth of the 
halo potential describing the profile shape. The 7 mod¬ 
els approximate DARKexp very well over nearly four 
decades in radius (Hjorth et al. 20 15). 23 

4. Pseudo-isothermal (PI) sphere model: 

Ph(r) = Pc (32) 

1 + (r/r c ) 2 

with p c and r c the core density and radius, respectively. 
The corresponding asymptotically flat circular velocity 
is 14 = ( 47 rGp c rc ) 1,/2 (Shao et al. 2013). 

5. Burkert model (Mori & Burkert 2000): 

PhO) = n , w. ,2/2' ( 33 ) 

(1 + r/r 0 )(l + r 2 /rg) 

with p^ and tq the core density and radius, respectively. 


23 The DARKexp density profile is also well approximated by an Einasto 
profile at small halo radii (Hjorth et al. 2015). 


5.2.2. Projected Halo Model 

To interpret the averaged lensing signal in the context of 
the standard ACDM model, we employ as our reference 
model the halo model prescription of Oguri & Takada (201 1). 
Specifically, we take into account the large-scale clustering 
contribution P 2 \i(j') as 


Ap(r) = / t (r) p b (r) + P2h(r), 



(35) 


where / t (r) describes the steepening of the density profile 
around a truncation radius 77 (Baltz et al. 2009; Diemer & 
Kravtsov 2014). We fix the truncation parameter T 200 C = 
rt/r 2 ooc = 3 (Covone et al. 2014; Umetsu et al. 2014; 
Sereno et al. 2015c), a typical value for cluster-sized halos 
in the ACDM cosmology (Oguri & Hamana 2011). When 
the two-halo term is neglected, the standard NFW halo model 
(Oguri & Hamana 2011) reduces to the Baltz-Marshall- 
Oguri (BMO) model that describes a truncated NFW profile, 
ft{r)pNFw(r) (Baltz et al. 2009). 

For an ensemble of clusters with mass M and redshift z, 
the two-halo term is expressed as P 2 h(f) = p m 6 h(A / /)^(r) 
with ~p m the mean background density of the universe, b b (M) 
the linear halo bias, and (r) the linear matter correlation 
function, all evaluated at z = ((zi)) ~ 0.34 in the WMAP 
seven-year cosmology (Section 3.6.3). The two-halo term is 
proportional to the product / 7 a 2 , where er 8 is the rms am¬ 
plitude of linear mass fluctuations in a sphere of comoving 
radius 8 Mpc ft -1 . In the adopted cosmology, 07 = 0.81 (Ko¬ 
matsu et al. 201 1). We compute the total surface mass density 
£(i?) by projecting A p(r) = ftPih + P 2 h along the line of 
sight (Section 2.2 of Oguri & Hamana 201 1). To evaluate the 
halo bias factor b b (M), we adopt the model of Tinker et al. 
(2010), which is well calibrated using a large set of A'-body 
simulations. We use their fitting formula with a halo mass 
definition of A c = 200 (Umetsu et al. 2014), corresponding 
to A m ~ 420 in our adopted cosmology. 


5.3. Model Comparison with Observations 

We constrain model parameters using the ((£)) profile and 
its total covariance matrix C (Section 5.1). The 7 2 minimiza¬ 
tion is performed using the MINUIT minimization package 
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Table 4 

Best-fit models for the stacked mass profile of the CLASH X-Ray-selected subsample 


Model 

M 200 C 

C200c 

Shape/Structural Parameters 

K 

X 2 / dof 

PTE“ 

Notes 


(10 14 m 0 h~ 2 ) 







NFW 

!4-4±i;S 

q 7fi+0.29 
<u -0.27 

7c = 1 

— 

11.3/11 

0.419 

No truncation 

gNFW 

14-iiLi 

4 04”*~°' 53 
^• U ^-0.52 

7c = 0.85/°;” 

— 

10.9/10 

0.366 

No truncation 

Einasto 

14.7± 4 i 

q cq+0.36 
°°°-0.39 

a E = 0.232/0° 42 

— 

11.7/10 

0.306 

No truncation 

DARKexp-7 b 

14-5 /);) 

q cq+0-42 

o.oo—o 42 

<fo = 3.90/°;)) 

— 

13.5/10 

0.198 

No truncation 

Pseudo isothermal 

— 

— 

V c = 1762/)°kms- 1 ,r c = 69//kpc 

— 

23.6/11 

0.015 

No truncation 

Burkert 

H-6±o.l 

— 

r 200c/r 0 = 8.81/°')) 

— 

29.9/11 

0.002 

No truncation 

Power-law sphere 

12 5 +0 - 8 
iZ , o - 0. 8 

— 

fyj _ 1 70+0.02 

7c — 1. /8 q 02 

— 

93.5/11 

0.000 

No truncation 

Halo Model c : 








NFW+LSS (i) 

i4-i±l:8 

q 7Q+0.30 
<y -0.28 

7c = 1 

9.3 

10.9/11 

0.450 

ACDM (M) scaling 

NFW+LSS (ii) 

!4-4±i;| 

3 74+ 0 - 33 
' -0.30 

7c = 1 

7 4+4-6 
'•^—4.7 

10.8/10 

0.377 

as a free parameter 

Einasto+LSS (i) 

14.3//) 

3 69 - * -0 ' 36 
°- uy —0.42 

^ _ r» O/IS+0.051 

“E — 0.248_ 0 047 

9.3 

10.7/10 

0.385 

ACDM 6^ (M) scaling 

Einasto+LSS (ii) 

14.5/1;* 

3 65”*”°' 47 
O.OO—o 61 

^ _ n O4C+0.061 

a E — U.z45_o Q53 

0 7+6.3 
‘ -5.6 

10.6/9 

0.301 

as a free parameter 

DARKexp+LSS (i) 

14.2/)'/ 

3 64 +0 - 44 
°'°^-0.46 

<t> 0 = 3.89/°')) 

9.3 

11.7/10 

0.308 

ACDM bh(M) scaling 

DARKexp+LSS (ii) 

i4.o/);® 

3 69”*”°' 53 
J,uy —0.57 

<t>0 = 3.85/°;®) 

10.1/);) 

11.6/9 

0.235 

as a free parameter 


a Probability to exceed the observed % 2 value. 

b We use Dehnen-Tremaine 7 -models with the central cusp slope 7 C = 3 log 10 0o — 0.65 (1.7 < <fio < 6 ) as an analytic fitting function for the DARKexp density profile. 
c For halo model predictions, we decompose the total mass overdensity A p(r) = p(r) — p m as A p — ftPh + P2h where ph (r) is the halo density profile, p2h ( r ) — Pm^h^m ( r ) 
is the two-halo term, and /t(r) = (l + r 2 /r 2 ) -2 describes the steepening of the density profile in the transition regime around the truncation radius rt, which is assumed to be 
n = 3r 2 ooc- 


from the CERN program libraries. The best-fit parameters 
are reported in Table 4, along with the reduced x' 2 and corre¬ 
sponding probability to exceed (PTE, hereafter) values. 

We first consider halo density profile models without in¬ 
cluding the two-halo term. In this modeling approach, no 
truncation is applied when calculating the surface mass den¬ 
sity £(./?), effectively accounting for the large-scale contri¬ 
bution. Figure 4 shows the best-fit model profiles along 
with the observed ((E)) profile. We find that two- or three- 
parameter cuspy models, namely, the NFW, gNFW, Einasto, 
and DARKexp -7 models, provide satisfactory fits (PTE > 
0.05) to the data. The best-fit gNFW model has a central 
cusp slope of 7 C = 0.85^0 31 > being consistent with the NFW 
model ( 7 C = 1). The cored PI and Burkert models provide 
poor fits with ,\ 2 /dof (PTE) values of 23.6/11 (1.5 x 10 -2 ) 
and 29.9/11 (1.7 x 10~ 3 ), respectively. The power-law sphere 
model has a reduced \ 2 value of y 2 /dof = 93.5/11 and is 
strongly disfavored by the ((E)) profile having a pronounced 
radial curvature. 

Now we examine projected halo models following the pre¬ 
scription described in Section 5.2.2. In what follows, we will 
focus on our best models, specifically, the NFW, Einasto, and 
DARKexp -7 cuspy density profiles. We note that since the 
halo bias is given as a function of M 2000 the number of free 
parameters is unchanged for each case. We find these mod¬ 
els (NFW+LSS (i), Einasto+LSS (i), and DARKexp+LSS (i) 
in Table 4) give statistically comparable fits. In all cases, in¬ 
cluding the two-halo term improves the fits, while keeping 
the best-fit parameters essentially unchanged (the difference 
in each parameter is much less than the ler uncertainty). In¬ 
dependent of the chosen profile, we find 7 (A^oiic) ~ 9.3 
(6hc| ~ 6.1) for our best-fit models. 

On the basis of the goodness-of-fit statistic, the highest- 
ranked model among those considered is the NFW halo model 
(NFW+LSS (i); see the red curve in Figure 4) with M 2 ooc = 
14.l/)j) x 1O 14 M 0 and C 200 C = 3.79/])'))!, followed by 
the Einasto and DARKexp halo models (Einasto+LSS (i) and 
DARKexp+LSS (i)). The best-fit NFW halo model yields 


an Einstein radius of 0Ein = 14. 0/3 \ arcsec at z s = 2, 
which is about 1.7er lower than the median effective Ein¬ 
stein radius (see Section 3.5) of (?Ein = 20.1" observed 
for the X-ray-selected subsample (Section 3.5). The best-fit 
NFW parameters of our projected halo model are in good 
agreement with those from the stacked shear-only analysis of 
Umetsu et al. (2014), M 200c = 13.4/;);° x 10 14 M Q li " 1 and 
C 200 C = 4.0llg;32, in which the two-halo term can be safely 
neglected as the stacked tangential-shear signal, ((AE)), is 
only sensitive to the intra-halo mass distribution in our radial 
range. The Einasto shape parameter is constrained to be «e = 
0.248/q'q|) from an Einasto halo-model fit to the ((E)) pro¬ 
file (Einasto+LSS (i)), in agreement with ctE = 0.19l/°'Qgg 
from the stacked shear-only analysis of Umetsu et al. (2014). 
Our measurements agree well with predictions from ACDM 
numerical simulations, «e = 0.21 ± 0.07 (Meneghetti et al. 
2014, «e = 0.24 ± 0.09 when fitted to surface mass den¬ 
sity profiles). The fitting formula given by Gao et al. (2008) 
yields cke — 0.29 for our X-ray-selected subsample. This is 
consistent with our results at the la level. 

The halo-model interpretation we have adopted is a crude 
approximation in the transition between the intra-halo and 
two-halo regimes, neglecting nonlinear effects (Baldauf et al. 
2010; Diemer & Kravtsov 2014). In order to effectively ac¬ 
count for possible modifications due to nonlinear effects, here 
we allow the halo bias factor b\ t to be a free parameter. We 
find that allowing 6 h to be free does not improve the fits and 
that the resulting best-fit parameters remain unchanged within 
the errors (Table 4). The effective halo bias bh is detected at 
1 . 6 cr- 2 .Ocr significance, where the exact level of significance 
depends on the details of the halo density profile in the transi¬ 
tion regime. In Figure 3, we compare the resulting NFW halo 
model profile (NFW+LSS (ii), red shaded area) and the NFW 
profile (cyan-shaded area) along with the ((E)) profile. 


5.4. Interpreting the Effective Halo Mass 
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FigureS. Cumulative mass profiles M/± = A/ 31 ) (< r A ) for the X- 
ray-selected subsample of 16 clusters, shown at several characteristic val¬ 
ues of the spherical mass overdensity A c = Ph(< r )/pc- The gray lines 
show the A/a, profiles for individual clusters obtained from NFW fits to 
their S profiles (Tables 2 and 3; Figure 1 1). The blue line and the cyan- 
shaded area show the mean and its lcr error of the composite-halo profile 
{(M/\)) from a sensitivity-weighted average of the individual M/a profiles. 
The filled square at A c = 200 marks the best-fit halo mass Moooc = 
14.1 ~ j' q x 10 14 Mq hyg obtained from a halo-model fit (NFW+LSS (i) 
in Table 4) to the stacked ((E)) profile (Figure 4). This is in excellent agree¬ 
ment with the mean sample mass ((A/aoOc)} = (14.3+1.1) X i 0 14 A/q hyg 
averaged using tr(VV) (Equation (27)) as weights. 


Interpreting the effective mass from stacked lensing re¬ 
quires caution when the cluster sample spans a broad range 
of masses and redshifts (Mandelbaum et al. 2005; Niikura 
et al. 2015). If there is a significant scatter in the luminosity- 
mass relation of halos, then the halo mass distribution of 
a luminosity-selected sample becomes significantly broader, 
with the width and asymmetry of the distribution typically in¬ 
creasing for higher luminosity bins. Accordingly, there is no 
typical mass, and in general the effective mass determination 
from NFW fits falls between the mean and the median masses 
of the halo population (Mandelbaum et al. 2005). Alterna¬ 
tively, if a halo sample has a broad redshift distribution, the ef¬ 
fective lensing mass determined from stacking with redshift- 
dependent weights (Equation (27)) is generally different from 
the mean population mass (Umetsu et al. 2014). 

To assess this possibility, we compare the stacked lensing 
results with the individual cluster masses (Tables 2 and 3). 
For the former, we consider our best model (NFW+LSS (i) in 
Table 4) for the X-ray-selected subsample, namely the NFW 
halo model using the b^-M relation of Tinker et al. (2010). 

Following Umetsu et al. (2014), we construct a composite- 
halo mass profile ((M/a)) (see also Coupon et al. 2013; Ford 
et al. 2014) from a sensitivity-weighted average of NFW fits 
to individual cluster S profiles as 


«Ma» 


E„tr(W n )M A , n 

En MW„) 


(36) 


using tr(W n ) oc (Equation (27)) as an ef¬ 

fective sensitivity weight for each cluster. Hence, the 
weight depends on cluster redshift z\ through X( CjOC )- 


The error variance for ((M/a)) is given by = 

En tr ( W «)] -2 E n a A ,n[ tr ( w «)] 2 with cr A ,„ the la uncer¬ 
tainty for M/a,u- It was shown by Umetsu et al. (2014) that 
this composite-halo approach using the trace approximation 
for the sensitivity matrix can give an adequate description of 
the observed stacked lensing signal. 

The results of this comparison are summarized in Figure 
5. At A c = 200, we find a sensitivity-weighted average 
of ((M 200c » = (14.3 ± 1.1) x 1O 14 M 0 for the X-ray- 
selected subsample, in excellent agreement with the best-fit 
halo mass of M 2 oo c = 14.ll 4 g x 1O 14 M 0 extracted 
from a halo-model fit to the ((E)) profile. On the other hand, 
the unweighted median masses of the clusters are ~ 3%-8% 
higher than the sensitivity-weighted masses ((M/a)) at each 
overdensity. This difference is not due to the asymmetry of 
the distribution of the cluster masses because the unweighted 
median and mean masses agree to within ~ 2% at each over¬ 
density. Our results appear to be robust against different 
ensemble-averaging techniques once the effects of sensitiv¬ 
ity weighting are consistently taken into account, supporting 
the findings of Umetsu et al. (2014). 


6. CLASH CONCENTRATION-MASS RELATION 
6.1. Bayesian Linear Regression 

We examine the relationship between halo mass and con¬ 
centration for our CLASH X-ray-selected subsample. Here 
we closely follow the Bayesian regression approach of Sereno 
et al. (2015b), taking into account the correlation between the 
errors on mass and concentration (e.g., Du & Fan 2014) and 
the effects of nonuniformity of the mass probability distribu¬ 
tion (e.g., Kelly 2007; Andreon & Berge 2012). We consider 
a power-law function of the following form: 


C200C = 10“ 


/ M 2 ooc A 

V ^piv ) 


l + z 
1 + £piv 


7 


(37) 


with Mpi v and z p i v the pivot mass and redshift, respectively. 
We set Mpiv = 10 15 M 0 /i _1 and z p - lv = 0.34, approxi¬ 
mately the effective mean values of the cluster mass and red¬ 
shift, respectively (Section 5). In the present analysis, we fix 
7 = —0.668 (Meneghetti et al. 2014) as predicted for the 
CLASH X-ray-selected population (for details, see Section 
6.2) because our data do not have sufficient statistics to con¬ 
strain the redshift evolution of the c-M relation (Merten et al. 
2015). 

To perform regression analysis we define new dependent 
and independent variables in logarithmic form as 


4" = log 10 


l + z 

1 + 2 piv 


-7 

C 200 c , 


X = log 10 (M 2 ooc/Mpi v ). 


(38) 


Equation (37) is then expressed by a linear relation, Y = 
a + /3X. The observed values of latent variables are denoted 
with lowercase letters. If the selected clusters in our sample 
are not uniformly distributed in logarithmic mass X, this can 
lead to biased estimates of regression parameters (Kelly 2007; 
Sereno et al. 2015b). We properly account for these effects in 
Bayesian regression (Kelly 2007; Sereno et al. 2015a; Sereno 
& Ettori 2015a; Sereno et al. 2015b). 

The clusters in our X-ray-selected subsample are selected 
to be X-ray hot (> 5keV) and regular in X-ray morphol- 
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ogy (Postman et al. 2012). In general, if we select a clus¬ 
ter sample by imposing certain thresholds on cluster observ¬ 
ables, the steepening at the high end of the intrinsic mass func¬ 
tion, combined with the selection effects, makes the result¬ 
ing mass probability distribution of selected clusters approxi¬ 
mately lognormal (Appendix A of Sereno & Ettori 2015a). 

In this study, we model the intrinsic probability distribution 
V{X) of logarithmic mass X with a single Gaussian func¬ 
tion characterized by two parameters, namely the mean // and 
the dispersion r (Kelly 2007). This approach alleviates the 
problem of assuming a uniform prior distribution on the in¬ 
dependent variable X and, in general, provides a good ap¬ 
proximation for a regular unimodal distribution (Kelly 2007; 
Andreon & Berge 2012; Sereno & Ettori 2015a). A uniform 
prior distribution can be recovered in the limit of r —> oo. 

The conditional probability distribution V(y\x) = 
Y[ n V(y n \x n ) of y = {y n } given x = {z„} is then written 
as (Kelly 2007) 


In (2^)+ ^" ^ Xn) ^ , 

(39) 

where (y n \x n ) and <r 2 = Var(y n |a: T1 ) are the conditional 
mean and variance of y n given respectively: 


\nV(y\x) = 

n 


{Vn\Xn) 


i o i 3” Cxy,n / 

a + Pn H- 2 r [Xn 

I o xx,n 


p), 


a n ~ & 2t2 + a Y\X + ~ 


(/?T 2 + C X y,n ) 2 
T 2 “t“ Cxx.n 


(40) 


where (Jy\x is the intrinsic scatter in the Y—X relation; 
C X x,m C yy ,n, and C xy n = C yx , n are the elements of the 
covariance matrix between the observables x n and y„ of the 
nth cluster. If V{X n ) is assumed to be uniform, we have 
{yn\Xn) = tr T f3x n and cr n = 5“ ^yy,n T ft G xx>n 

28C xy , n (e.g„ Kelly 2007; Okabe & Smith 2015). 

We use uninformative priors for all parameters. We adopt 
uniform priors for the intercept a, the slope (3, and the mean //. 
For the variance parameters er 2 -| v and r 2 , we use the inverse 
Gamma distribution, IT(e, e), with e -C 1 a small number 
(Andreon & Hum 2010; Sereno & Ettori 2015b, a). In our 
analysis, we choose e = 10 -3 (Sereno & Ettori 2015b, a). 

Figure 6 shows the two-dimensional marginalized poste¬ 
rior distributions derived for all pairs of the regression pa¬ 
rameters (ct, /3, <Jy\x, Mi T )- The marginalized constraints 
(Cbi ± SYsi; Section 4.2) on the intercept, slope, and intrin¬ 
sic scatter are a = 0.60 ± 0.04, f3 = —0.44 ± 0.19, and 
ay\x = 0.056 ± 0.026 (or, a natural logarithmic scatter of 
0.13 ± 0.06). The posterior distributions show a long tail ex¬ 
tending toward positive values of /?, corresponding to shal¬ 
lower slopes for the c-M relation. This tail is associated with 
small values of the r parameter that describes the width of 
the mass probability distribution V(X). Hence, accounting 
for the nonuniformity of the mass probability distribution is 
crucial for making inference of the underlying c-M relation 
for the CLASH sample. We have checked that the tail in the 
posterior distribution of j3 disappears if V{X) is assumed to 
be uniform. We see from Figure 6 that the constraint on the 
t parameter is dominated by the prior especially at r ^ 0.5 
and that the data are not informative enough to determine the 
dispersion of the intrinsic mass distribution. 



Figure 6. Constraints on the regression parameters (a, /?, <Ty\x 1 1‘ - T ) °f 
the c-M relation (Section 6.1) obtained from HST and Subaru lensing obser¬ 
vations of 16 CLASH X-ray-selected clusters, showing marginalized one¬ 
dimensional distributions and two-dimensional 68% and 95% limits. For 
each parameter, the blue solid line shows the biweight central location (Cbi) 
of the marginalized one-dimensional posterior distribution. For the r param¬ 
eter, the prior probability distribution function is shown by the red line. 

6.2. Comparison with Predictions for the CLASH Survey 

In Figure 7, we summarize in the c-M plane our regression 
results obtained for our 16 CLASH X-ray-selected clusters. A 
detailed comparison with the SaWLens results (Merten et al. 
2015) is presented in Section 7.4.2. 

Understanding the selection function and observational bi¬ 
ases arising from projection effects is crucial when inter¬ 
preting the c-M relation derived from lensing observations. 
For the CLASH sample, this has been addressed in detail by 
Meneghetti et al. (2014) using a sample of ~ 1400 cluster¬ 
sized halos (0.25 < z < 0.67) selected from the MUSIC-2 
nonradiative hydrodynamical A'-body simulations (Sembolini 
et al. 2013) in a ACDM universe (h = 0.7, f2 m = 0.27, = 

l-fi m , tr 8 = 0.82). 

Meneghetti et al. (2014) characterized a sample of ha¬ 
los that follows the CLASH selection function based on X- 
ray morphological regularity. Their results suggest that the 
CLASH X-ray-selected subsample is prevalently composed 
of relaxed clusters (~ 70%) and largely free of orientation 
bias (Section 5). Another important implication is that this 
subsample is expected to have a very small scatter in concen¬ 
tration because of the high degree of regularity in their X-ray 
morphology. 

Meneghetti et al. (2014) find that the mean two-dimensional 
concentration of the CLASH X-ray-selected sample is ex¬ 
pected to be ~ 11% higher than that for the full population 
of clusters (Meneghetti et al. 2014). According to their sim¬ 
ulations, we expect that the NFW concentrations of this sam¬ 
ple determined from noise-free X profiles are in the range 
3 ~ C 200 C ~ 6, with a mean (median) value of 3.87 (3.76) 
and a standard deviation of 0.61. The distribution follows a 
power-law relation of the form c oc M -° 160 ± 0108 x (1 + 
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Figure 7. Upper panel: concentration-mass relation for the CLASH X-ray-selected subsample of 16 clusters derived from a joint analysis of HST and Subaru 
lensing data sets. The black squares with error bars represent the measured parameters and their la uncertainties for individual clusters. The gray shaded region 
shows the lcr confidence region of the CLASH c-M relation (z = 0.34) from our Bayesian regression. The blue triangle shows the best-fit parameters from a 
halo-model fit (NFW+LSS (i) in Table 4) to the ensemble-averaged surface mass density profile, ((5])) (Figure 4). The yellow contours represent the lcr and 2cr 
confidence regions determined from the stacked shear-only analysis of the same sample (Umetsu et al. 2014). The cyan-shaded band shows the lcr uncertainty 
on the CLASH c-M relation obtained by Merten et al. (2015). The red-solid line represents the theoretical expectation from numerical simulations accounting 
for the projection effects and the CLASH selection function based on X-ray morphology (Meneghetti et al. 2014). The red-dashed and red-dotted lines show 
the intrinsic three-dimensional c-M relations for the relaxed and full populations, respectively (Meneghetti et al. 2014). The lower panel shows, for each case, 
the ratio between the measured concentration and the predicted value using the c(M, z) relation for the CLASH X-ray-selected population (red-solid line in the 
upper panel). 


z )-o. 668 ±o. 34 i w ith a normalization of c| z= o .34 = 3.96±0.14 
at 10 15 Mq ft -1 . This model, shown as the red solid line 
in Figure 7, is in excellent agreement with our regression 
results (gray shaded area) with c| 2= o .34 = 3.95 ± 0.35 at 
10 15 M 0 ft, -1 , as well as with the stacked lensing measure¬ 
ments: C 200 C = 3.79^q 28 (blue triangle; Section 5) from our 
stacked lensing analysis and C 200 C = 4.01+q 32 (yellow con¬ 
tours) from the stacked shear-only analysis of Umetsu et al. 
(2014). The derived slope of (3 = —0.44 ± 0.19 is some¬ 
what steeper than, but consistent within errors with, the pre¬ 
dicted values of /3 = — 0.160±0.108 (Meneghetti et al. 2014). 
The intrinsic scatter, ay\x = 0.056 ± 0.026, is in agree¬ 
ment with the expected standard deviation of 0.61 around 
the mean 3.87 (Meneghetti et al. 2014), corresponding to 
o ’(l°g 10 C 200 c) ~ 0.07, or er(lnc 2 ooc) ~ 0.16. This is signif¬ 
icantly smaller than the typical intrinsic scatter predicted for 
the full (relaxed) population of halos, <7(log 10 C 2 oo c ) ~ 0.14- 
0.16 (0.1-0.12) (Neto et al. 2007; Duffy et al. 2008; Bhat- 
tacharya et al. 2013). 

7. DISCUSSION 
7.1. Systematic Errors 

As described in Section 3, we have accounted for var¬ 
ious sources of errors associated with our strong-lensing, 
weak-lensing shear and magnification measurements. All 
these errors are encoded in the measurement uncertainties 
(er_t_, fr /( , om) that enter the joint likelihood analysis (Section 


3.6) and contribute to the posterior covariance matrix (7 stat 
of the mass profile solution s = S/Soo, c - In particular, our 
magnification bias analysis (Section 3.4) accounted for spuri¬ 
ous large-scale variations of the red galaxy counts (cr® ys ), as 
well as the angular clustering (crjf 1 ) and Poisson error (cr® tat ) 
contributions. The fractional area / mas k lost to cluster mem¬ 
bers, foreground objects, and defects was calculated as a func¬ 
tion of clustercentric radius and corrected for in the source 
counts according to Equation (10). In our mass profile anal¬ 
ysis, the total covariance matrix C includes additional con¬ 
tributions from the residual mass-sheet uncertainty C sys , the 
cosmic noise C ,|ss , and the intrinsic variations C lnt of the clus¬ 
ter signal due to the c-M variance, halo asphericity, and the 
presence of correlated halos (Section 3.6.3). 

Additionally, we quantified potential sources of system¬ 
atic errors in our weak-lensing shear+magnification mea¬ 
surements as follows (Section 3): (1) dilution of the weak- 
lensing signal by cluster members (2.4%, Section 3.2), (2) 
photometric-redshift bias in the mean depth estimates (/?) 
(0.27%, Section 3.2), and (3) shear calibration uncertainty 
(5%, Section 3.3). These systematic errors scale approxi¬ 
mately linearly with the cluster mass and add to 5.6% in 
quadrature. For the absolute calibration of cluster masses, 
one needs to take into account the systematic bias due to mass 
model uncertainties. Meneghetti et al. (2014) find that spher¬ 
ical NFW masses of cluster-sized halos estimated from their 
surface mass densities are biased low on average by 5% and 
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that the bias is significantly reduced for relaxed halos (1%- 
2%) because they are more spherical than unrelaxed ones. 
According to their simulations, the degree of negative bias 
expected for our sample dominated by relaxed clusters is 3% 
(Massimo Meneghetti, private communication). Hence, the 
systematic uncertainty in the absolute mass calibration is es¬ 
timated to be ~ 6%. This implies that the total uncertainty in 
the absolute mass calibration with the sample of 20 clusters is 
v/0.28 2 /20 + 0.06 2 ~ 9% at A c = 200 (see Section 4.2). 


0-1/200m /2500c /'1000c /500c /200c /200m 



Spherical radius, r[h 1 kpc] 

Figure 8 . Ratio of cluster masses M 30 (< r) from NFW fits to the S profile 
obtained in our weak+strong lensing analysis (Figure 1 1 ) and to that from the 
weak-lensing analysis by Umetsu et al. (2014). The results are shown for our 
full sample of 20 clusters (gray lines). The black line and the cyan-shaded 
area show the geometric-mean mass ratio and its lcr uncertainty, respectively. 
The dashed horizontal line marks the 1:1 relation. This comparison shows 
that, adding the inner strong-lensing information to the weak-lensing con¬ 
straints has a substantial impact on the individual cluster mass determinations 
at r L f 2500 c- When averaged over the 20 clusters, our ensemble analysis 
shows no significant evidence for a systematic bias between weak-lensing- 
only and weak+strong-lensing measurements. 

7.2. Impact of Adding HST Lensing Data 

A joint analysis of multi-scale, multiple lensing probes 
makes it possible to improve the precision of cluster mass pro¬ 
file reconstructions over a wide range of clustercentric radii 
and to self-calibrate systematics as well as observational pa¬ 
rameters that describe the intrinsic properties of source pop¬ 
ulations (Rozo & Schmidt 2010; Umetsu 2013). When the 
wide-held CLASH weak-lensing data are combined with the 
inner HST lensing constraints (Nsh = 4), the central weak- 
lensing bin £(< 0.9') of Umetsu et al. (2014) is resolved into 
5 radial bins of the surface mass density. We find a significant 
improvement of ~ 45% on average in terms of the total S/N 
(Equation (24)) from adding the HST lensing information to 
the wide-held weak-lensing data. 

The improved sensitivity and resolution at 10"-40" have 
also allowed us to determine the inner characteristic radius 
r _2 and the halo concentration C 200 C = f 2 ooc/f -2 for each 
individual cluster (Tables 2). Umetsu et al. (2014) did not 
attempt to determine C 200 C for each cluster because the weak- 
lensing prohles of individual clusters are highly degenerate 


in M 200 C an( i C 200 C, which can potentially bias the slope of 
the c-M relation determined from weak lensing (Hoekstra 
et al. 2011; Du & Fan 2014). This is particularly the case for 
high- 2 , low-mass systems, for which the characteristic prohle 
curvature around r_ 2 is unconstrained by noisy and sparse 
weak-lensing data. For such clusters, the constraints on C 200 C 
are essentially imposed by prior information. We note again 
that the 6" nr contribution to the total covariance matrix C was 
not included in the individual cluster mass measurements of 
Umetsu et al. (2014, Table 6), so that their mass measurement 
errors were underestimated with respect to this work (Section 
4.2). 

A multi-probe approach combining complementary probes 
of cluster lensing allows us to test the consistency and robust¬ 
ness of cluster mass measurements (Umetsu 2013). Now we 
compare our weak+strong lensing mass estimates derived for 
our full sample of 20 CLASH clusters with the weak-lensing 
masses of Umetsu et al. (2014) to assess the level of system¬ 
atic uncertainties in the ensemble mass calibration. 

In Figure 8, we show for each cluster the weak+strong to 
weak lensing mass ratio, M^ L+SL /M^ L , as a function of 
spherical radius r. The results are shown in the range r = 
[200, 2000] kpc L _1 (0.1 r/?' 2 oom ~ 1) where the weak- 
lensing mass measurements were obtained by Umetsu et al. 
(2014, see their Figure 4). At each cluster radius, we com¬ 
pute the unweighted average of the mass ratios for our sam¬ 
ple using geometric averaging (Donahue et al. 2014; Umetsu 
et al. 2014). 24 Figure 8 shows that the ensemble-averaged 
mass ratio (M^ L+SL /M^ L ) is consistent with unity within 
the errors at all cluster radii. In particular, the mass offset is 
within 2% at r 600 kpc /i _1 ~ 1.5r25ooc- We see a trend 
for the average ratio to decrease toward the center, reach¬ 
ing 0.95 ± 0.06 at r = 200 kpc h~ l . This indicates that the 
HST analysis (Zitrin et al. 2015) favors relatively low central 
masses and hence low halo concentrations although the differ¬ 
ence is not significant, | (M^ l+sl /M)q L ) — 1] < 5% ± 6%, 
compared to the sensitivity limit with 20 clusters. This level 
of mass offset is smaller than, but consistent with, the value 
8% ± 9% based on the shear-magnification consistency test 
of Umetsu et al. (2014). 

On the basis of these consistency checks, the residual sys¬ 
tematic uncertainty in the ensemble mass calibration is esti¬ 
mated to be of the order ~ 5%-8% in the one-halo regime, 
r = [200, 2000] kpc hr 1 . This however implies that, on an in¬ 
dividual cluster basis, there is a large scatter of ~ 20%-40% 
between different reconstruction methods that use different 
combinations of data. The level of uncertainty in the ensemble 
mass calibration (5%-8%) empirically estimated using differ¬ 
ent combinations of lensing probes is in agreement with the 
systematic uncertainty in the absolute mass calibration (6%) 
assessed in Section 7.1. 

7.3. Ensemble Calibration of Cluster Masses 

CLASH provides a sizable sample (20 clusters at 0.19 < 
2 < 0.69, 2 = 0.377) for the calibration of the high end of the 
cluster mass function. In principle, weak lensing can yield 
unbiased mass estimates (assuming sphericity) for a sample 
of clusters that is largely free of orientation bias (Meneghetti 

24 The geometric mean (X) is defined as (X) = (][ { X n ) 1 / ,v = 

exp f f T. ?_ 1 In X n ), so that (Y/X) = l/(X/Y) for the ratios of sam¬ 
ples X and Y. 
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Figure 9. Comparison of our weak+strong lensing mass measurements (Mclash) of 20 clusters to results from Merten et al. (2015, SaWLens), Applegate 
et al. (2014, WtG), Hoekstra et al. (2015, CCCP), and Okabe & Smith (2015, LoCuSS). For each comparison, we measure the mass of clusters within characteristic 
overdensity radii of the respective work assuming the spherical NFW density profile (Section 7.3). The dashed line shows the one-to-one relation. 


et al. 2014). In practice, however, lensing mass measure¬ 
ments can be subject to various (known and unknown) sys¬ 
tematic effects, as discussed in Section 7.1. In our early work 
(e.g., Broadhurst et al. 2005; Umetsu & Broadhurst 2008), 
we established that the dominant source of systematic effects 
in cluster weak lensing is the contamination of background 
galaxy samples by cluster members, which can lead to a sub¬ 
stantial underestimation of the true lensing signal. This point 
has been supported by recent observations (Okabe et al. 2013; 
Applegate et al. 2014; Hoekstra et al. 2015) through system¬ 
atic mass comparisons between different surveys that use dif¬ 
ferent approaches to measuring weak lensing. 

Here we compare our cluster mass estimates (Tables 2 and 
3) with those obtained from other cluster lensing surveys 
that overlap with our sample, namely the Weighing the Gi¬ 
ants program (WtG; Applegate et al. 2014), the Canadian 
Cluster Cosmology Project (CCCP; Hoekstra et al. 2015), 
and the LoCuSS (Okabe & Smith 2015, LoCuSS), 25 as well 

25 http://www.sr.bham.ac.uk/locuss 


as with those from the SaWLens analysis of Merten et al. 
(2015). The WtG sample is a representative X-ray lumi¬ 
nous subset of the ROSAT All-sky Survey (RASS) clusters 
at 0.15 < 2 < 0.7, with a median redshift of 2 = 0.387. The 
WtG clusters span a wide range of dynamical states, as well 
as of redshifts (von der Linden et al. 2014b). The CCCP sam¬ 
ple is a mixture of X-ray luminous clusters for which archival 
B- and i?-band observations made with the CFH12k camera 
on the 3.6 m Canada-France-Hawaii Telescope (CFHT) were 
available (Hoekstra 2007) and a temperature-selected subset 
of clusters drawn from the ASCA survey (k\>,Tx > 5 keV, 
Hoekstra et al. 2012), spanning the range 0.15 < 2 < 0.55 
(z = 0.233). The LoCuSS sample is drawn from the RASS 
catalogs at 0.15 < 2 < 0.3 (2 = 0.229) and is approx¬ 
imately X-ray luminosity limited (Okabe & Smith 2015). 
The observed X-ray temperatures of the LoCuSS clusters are 
fee Tx ~ 5keV (Martino et al. 2014, see their Figure 5). The 
LoCuSS sample is selected purely on the X-ray luminosity, 
ignoring other physical parameters and relaxation properties 
(Smith et al. 2016). 
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In all cases, the cluster masses are measured assuming 
a spherical NFW halo (Section 4.2). The WtG and Lo- 
CuSS mass measurements are based on weak-lensing obser¬ 
vations with Subaru/Suprime-Cam, whilst the CCCP survey 
uses weak-lensing data taken with CFHT. The CCCP and Lo- 
CuSS surveys used the BCG as the cluster center as done in 
our work (Section 3.1), whereas the WtG survey adopted the 
X-ray centroid as the cluster center (von der Linden et al. 
2014b). We note that, for all clusters in our sample, the 
mass measurements presented in Tables 2 and 3 are insensi¬ 
tive to the choice of the cluster center as discussed in Section 
3.1. Another key difference is that the LoCuSS and CLASH 
surveys controlled contamination of their background galaxy 
samples at the 2% level by imposing stringent color cuts 
(Section 3.2), albeit with increased shot noise, whereas the 
WtG and CCCP surveys boosted the diluted shear signal to 
correct statistically for contamination, by assuming that the 
observed number density profile of a pure background galaxy 
sample is flat. This correction, referred to as a boost factor, is 
not valid in general as it ignores the depletion or enhancement 
of the number density of background galaxies due to magnifi¬ 
cation bias (Umetsu et al. 2014; Okabe & Smith 2015; Ziparo 
et al. 2015; Chiu et al. 2016). 

In the following, we compare cluster masses between two 
studies by using the same aperture radii to avoid aperture mis¬ 
match problems. These comparisons are limited to those over¬ 
density radii (r a) where the fitting ranges typically overlap. 
The results of the comparisons are shown in Figure 9. For 
each case, we calculate the mass ratios for the overlap sample 
using the unweighted geometric mean (Section 7.2), unless 
otherwise noted. 

7.3.1. CLASH: Merten et al. (2015) 

There are 16 clusters in common with the CLASH 
SaWLens analysis of Merten et al. (2015). These are all 
CLASH X-ray-selected clusters. Merten et al. (2015) mea¬ 
sure masses by reconstructing two-dimensional convergence 
maps of individual clusters, binning the maps into E pro¬ 
files, and fitting these profiles with an NFW model within 
2Mpc hr 1 ( R /$ r v j r ), closely following the procedure sug¬ 
gested by Meneghetti et al. (2014). An important difference 
between the data used by Merten et al. (2015) and the data 
used here is the availability of azimuthally integrated magni¬ 
fication constraints (Umetsu et al. 2014). 26 

This comparison shows that, on average, the SaWLens 
masses are 7% ± 6% lower than our masses at A c = A v j r , 
200, and 500; their masses are 9% ± 11% lower than our 
masses at A c = 2500. This difference is smaller than the sys¬ 
tematic mass offset of ~ 10% ± 5% at r = 0.5 Mpc, 1 Mpc, 
and /’ 5 ooc found between the CLASH weak-lensing (Umetsu 
et al. 2014) and SaWLens (Merten et al. 2015) mass pro¬ 
files (see Donahue et al. 2014, Table 6). Hence, combining 
the central //.S'7-lensing and outer weak-lensing data has re¬ 
duced the discrepancy with respect to the SaWLens results. 
We find that this remaining discrepancy can be attributed to 
the three outliers discussed in Section 4.1, which correspond 

26 Merten et al. (2015) and Zitrin et al. (2015) use identical sets of HST 
lensing constraints (i.e., HST shear catalogs plus locations and redshifts of 
multiple images) as input for respective mass reconstructions. Merten et al. 
(2015) simultaneously combine the HST lensing constraints and ground- 
based shear catalogs of Umetsu et al. (2014). In this work, lensing constraints 
are combined a posteriori in the form of radial profiles according to the pro¬ 
cedure described in Section 3.6. 


to clusters at the lowest Galactic latitudes of the overlap sam¬ 
ple (MACSJ1931.82635, MACSJ0744.9+3927) and those at 
the highest redshifts (RXJ1347.51145, MACSJ0744.9+3927) 
which exhibit complex mass distributions with a high de¬ 
gree of substructure (Postman et al. 2012; Merten et al. 
2015). When excluding the three outliers, we find that the 
SaWLens mass estimates (MsaWLens) are statistically in 
excellent agreement with our results (Mclash) at all over¬ 
densities: (MsaWLens /MclASh) = 1-01 ± 0.07,1.00 ± 
0.07,0.99+0.07, and 0.95+0.12 at A c = A vir , 200,500, and 
2500, respectively. We note that this agreement is achieved 
in spite of using substantially different reconstmction meth¬ 
ods even though the data used are largely common to the two 
analyses. 

7.3.2. The WtG Project 

The WtG collaboration conducted weak-lensing shear 
mass measurements for 51 X-ray-selected luminous clus¬ 
ters at 0.15 < z ^ 0.7 (z = 0.387) using deep multi-color 
Subaru/Suprime-Cam and CFHT/MegaPrime optical imaging 
(von der Linden et al. 2014b; Applegate et al. 2014). Their 
cluster sample includes the majority of the CLASH clusters. 
There are 17 clusters in common between the two studies, 
both of which use Subaru data. The overlap sample includes 
14 CLASH X-ray-selected clusters and 3 high-magnification 
clusters. Applegate et al. (2014) derived cluster masses from 
NFW fits to reduced tangential shear profiles over the radial 
range 0.75-3 Mpc h(R a) 0.8rioooc) assuming a fixed con¬ 
centration parameter of C 200 C = 4 for all clusters. In contrast, 
we have measured masses from NFW fits to surface mass den¬ 
sity profiles within R < 2Mpc h~ l ~ 2.9Mpc h.T ( - , allowing 
both M 200 C and C 200 C as free parameters (see Section 4.2). 
This fitting procedure is the same as that adopted by Umetsu 
et al. (2014) and Merten et al. (2015). 

In the upper-right panel of Figure 9 we compare shear- 
only masses (MwtG) of Applegate et al. (2014) and our full- 
lensing masses (Mclash)* obtaining good agreement. We 
find that the WtG masses are 3% ± 9%, 7% ± 12%, and 
7% ± 12% higher than our masses at A c = 200, 500, and 
1000, respectively. We see a trend of increasing mass offset 
with increasing overdensity (or, decreasing overdensity radius 
/'a). However, the offsets are well within the ler errors and not 
statistically significant. 

7.3.3. The CCCP 

The CCCP conducted weak-shear lensing mass measure¬ 
ments for a sample of 52 clusters at 0.15 < z < 0.55 (z = 
0.233) on the basis of CFHT observations. We have 6 clusters 
in common with the CCCP project, including 5 CLASH X- 
ray-selected clusters (Abell 383, Abell 209, Abell 2261, Abell 
611, RXJ1347.5—1145) and one high-magnification clus¬ 
ter (MACSJ0717.5+3745). Hoekstra et al. (2015) measure 
NFW masses from CFHT reduced tangential shear measure¬ 
ments within 0.5-2 Mpc h^ 0 (f^sooc ~ R ~ L5r5oo c )> as¬ 
suming the c-M relation of Dutton & Maccio (2014) (see 
Section 7.4.1). 

We find that this mass comparison is somewhat sensitive 
to weighting schemes because of the large observational un¬ 
certainties in the CFHT-based CCCP masses (with a typical 
uncertainty of ~ 34%) and of the small number of clusters. 
Taking the error-weighted geometric mean (down-weighting 
clusters with noisy mass measurements), we find that the 
CCCP masses are on average 16%±10% and 9%±24% lower 
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than our masses at A c = 500 and 2500, respectively. With 
limited statistics, we do not find statistically significant differ¬ 
ences between the CCCP and our masses. We note that Hoek- 
stra et al. (2015) obtained an excellent agreement between the 
Subaru weak-lensing masses of Umetsu et al. (2014) and their 
masses measured from the Subaru imaging data processed by 
the CLASH collaboration. 27 Hoekstra et al. (2015) found that 
the Subaru-based CCCP masses are on average only ~ 2.4% 
lower than the CLASH weak-lensing masses of Umetsu et al. 
(2014). Since our mass calibration is highly consistent with 
that of Umetsu et al. (2014) (Section 7.2), a similar improve¬ 
ment is expected when the Subaru-based CCCP masses are 
compared to our weak+strong lensing masses. 

7.3.4. TheLoCuSS 

The LoCuSS has carried out a systematic weak-shear lens¬ 
ing analysis of a sample of 50 X-ray luminous clusters at 
0.15 < z < 0.3 (z = 0.229) based on deep two-band imag¬ 
ing with Subaru/Suprime-Cam (Okabe & Smith 2015). There 
are 5 clusters in common between the LoCuSS and our sam¬ 
ples (Abell 383, Abell 209, Abell 2261, RXJ2129.7+0005, 
Abell 611). Both studies use Subaru weak-shear lensing data. 
To reduce biases due to noisy inner profiles, Okabe & Smith 
(2015) optimize the binning scheme (i.e., the fitting range and 
the number of bins) for each individual cluster. They perform 
NFW fits to a suite of reduced tangential shear profiles that 
span inner radii in the range 50-300 kpc hr 1 , outer radii in 
the range 2000-3000 kpc h~ , and number of bins in the range 
4-8. The NFW concentration parameter is treated as a free 
parameter in their fits. 

For this comparison, we obtain excellent agreement be¬ 
tween the LoCuSS masses (Ml 0 CuSs) and our masses, even 
on an individual cluster basis (see the lower-right panel of Fig¬ 
ure 9). Accordingly, the comparison results are insensitive to 
the choice of weighting schemes. The LoCuSS masses are on 
average 0% ± 15%, 2% ± 13%, 7% ± 10%, and 16% ± 22% 
lower than our masses at A c = A v i r , 200, 500, and 2500, 
respectively. A better agreement is seen for lower overdensi¬ 
ties where the constraints are dominated by the Subaru weak- 
lensing measurements. A possible explanation for this excel¬ 
lent agreement is the fact that both surveys controlled contam¬ 
ination of the background samples at the A 2% level without 
employing a boost factor (Section 7.3) and both used inde¬ 
pendent but very similar shape measurement algorithms. In 
particular, Okabe & Smith (2015) adopted a shear calibra¬ 
tion procedure that is nearly identical to the one developed by 
Umetsu et al. (2010) and adopted by Umetsu et al. (2014) (see 
Section 3.3). Okabe & Smith (2015) obtained a shear calibra¬ 
tion bias of m ~ —3%, which is close to the value derived 
by Umetsu et al. (2010, m ~ —5%). We note that adding 
the inner strong-lensing information (Zitrin et al. 2015) to the 
weak-lensing constraints (Umetsu et al. 2014) has resulted in, 
on average, a decrease of the mass estimates, especially at 
r fz 1.5r25ooc (see Figure 8; Section 7.2). 

7.4. Cluster c—M Relation 

7.4.1. Comparison with ACDM Predictions from the Literature 

We compare our individual cluster mass and concentration 
measurements (Table 2; Figure 7) with predictions from nu¬ 
merical simulations in the literature. To statistically quantify 
the level of agreement with a given predicted c-M relation, 

27 https://archive.stsci.edu/prepds/clash/ 


we use frequentist measures of goodness of fit. Specifically, 
for a given fixed c(M, z) function, we evaluate the y 2 good¬ 
ness of fit to the null hypothesis that the sample data are de¬ 
rived from the model population. The y 2 statistic is then de¬ 
fined as 

loglO Cn ~ lQgio C ( M n, Z n ) j 2 

ttn 

where o n is the total statistical uncertainty of the nth cluster, 

tr n = ^int.O 4" @0^xx,n -l*xy.n (42) 

with (3q the mass slope of the intrinsic c-M relation under 
consideration and <Tj rlti o the intrinsic scatter in log 10 C 200 C at 
fixed mass and redshift. For all models, we fix (Ti nt) o = 
cryjx — 0.057 according to our regression results (Section 
6.1) and assume that the effective number of parameters for 
the null model is three (i.e., the intercept, mass slope, and 
redshift evolution). 

Table 5 lists for each model the values of y 2 and PTE, 
along with the weighted geometric average 2 * and the stan¬ 
dard deviation of observed-to-predicted concentration ratios 
c (obs) j c (pred) theoretical predictions from Meneghetti 

et al. (2014) are based on nonradiative simulations of DM 
and baryons, and those from the others are based on DM-only 
simulations. In all cases, halo masses and concentrations are 
defined using the overdensity A c = 200 and measured assum¬ 
ing the spherical NFW profile, either in projection (2D) or in 
three-dimensions (3D). For peak-height-based c{y) relations 
(Prada et al. 2012; Diemer & Kravtsov 2015), we assume the 
WMAP seven-year cosmology of Komatsu et al. (2011) to cal¬ 
culate the relationship between peak height and halo mass at 
each redshift. 22 

We first consider models for the full population of halos 
based on the three-dimensional characterization of the halo 
density profile. We find that recent c-M relations from Bhat- 
tacharya et al. (2013), Dutton & Maccio (2014), Meneghetti 
et al. (2014), and Diemer & Kravtsov (2015) are in satisfac¬ 
tory agreement with the data. The Meneghetti et al. (2014) 
model (PTE = 0.675), which is calibrated for a cosmology 
with a relatively high normalization (h = 0.7, f2 m = 0.27, 
cjg = 0.82; Section 6.2), yields the highest goodness of 
fit among those considered here, followed by the Dutton & 
Maccio (2014) model (PTE = 0.659) calibrated for the 
Planck cosmology (h = 0.671, Q m = 0.3175, cr 8 = 0.8344). 
Our measurements are 33%± 11% higher than the predictions 
of Duffy et al. (2008) based on the WMAP five-year cosmol¬ 
ogy (h = 0.742, fl m = 0.258, erg = 0.796). This is in line 
with the findings of Dutton & Maccio (2014), who showed 
that the c-M relation in the WMAP five-year cosmology has 
a 20% lower normalization at z = 0 than in the Planck cos¬ 
mology. On the other hand, the observed concentrations are 
27% ± 7% lower than the predictions of Prada et al. (2012). 
Their model exhibits a flattening and upturn of the c-M rela¬ 
tion in the high v regime, so that their concentrations derived 
for cluster halos are substantially higher than those of others. 
We refer the reader to Meneghetti & Rasia (2013) and Diemer 

28 Specifically, the weighted geometric average ( Y/X) is defined as 

(Y/X) = exp | In (Yn/X n )\ M n ) _1 1 with the in¬ 

verse variance weight for the nth cluster, Un 1 = fry n /X 2 + a f /Y„. 

29 We find that the average mass of the CLASH X-ray-selected subsample 
corresponds to a halo peak height of v ~ 3.8 in the adopted cosmology. 


x 2 = £ 
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Table 5 

Comparison of measured and predicted concentrations for the CLASH X-ray-selected subsample 


Author 

Sample 

3D/2D Function 3 

c (obs) y c (pred) 

x 2 PTE b 




Average 0 cr d 



Theory: 

Duffy et al. (2008) 

full 

3D 

c-M 

1.331 ± 0.108 

0.334 

22.6 

0.046 

Duffy et al. (2008) 

relaxed 

3D 

c-M 

1.165 ±0.094 

0.290 

13.6 

0.399 

Prada et al. (2012) 

full 

3D 

c-v 

0.733 ±0.065 

0.244 

24.6 

0.026 

Bhattacharya et al. (2013) 

full 

3D 

c-is 

1.169 ±0.095 

0.292 

14.1 

0.369 

Bhattacharya et al. (2013) 

relaxed 

3D 

c-is 

1.131 ± 0.092 

0.277 

12.4 

0.494 

Dutton & Maccio (2014) 

full 

3D 

c-M 

1.061 ± 0.086 

0.262 

10.4 

0.659 

Meneghetti et al. (2014) 

full 

3D 

c-M 

1.061 ± 0.089 

0.279 

10.2 

0.675 

Meneghetti et al. (2014) 

relaxed 

3D 

c-M 

0.990 ±0.083 

0.249 

9.2 

0.760 

Diemer & Kravtsov (2015) 

full (median) 

3D 

c-v 

1.021 ± 0.083 

0.330 

14.4 

0.349 

Diemer & Kravtsov (2015) 

full (mean) 

3D 

c-is 

1.060 ±0.086 

0.326 

13.8 

0.391 

Meneghetti et al. (2014) 

full 

2D 

c-M 

1.087 ± 0.092 

0.336 

13.5 

0.413 

Meneghetti et al. (2014) 

relaxed 

2D 

c-M 

1.040 ±0.086 

0.283 

10.8 

0.628 

Meneghetti et al. (2014) 

CLASH 

2D 

c-M 

0.988 ± 0.078 

0.227 

9.6 

0.730 

Observations: 

Merten et al. (2015) 

CLASH 

2D 

c-M 

1.133 ±0.087 

0.209 

9.2 

0.754 


a c-M : power-law c(M, z) relation; c—v : halo concentration given as a function of peak height i '(IV/, z). 
b Probability to exceed the measured y 2 value assuming the standard \ 2 probability distribution function. 
c Weighted geometric average of observed-to-predicted concentration ratios. 

^ Standard deviation of the distribution of observed-to-predicted concentration ratios. 


& Kravtsov (2015) for detailed discussions on the possible 
origin of this discrepancy. 

Next, we consider models derived for relaxed populations 
of cluster halos (Duffy et al. 2008; Bhattacharya et al. 2013; 
Meneghetti et al. 2014). Numerical simulations suggest that 
relaxed subsamples have concentrations that are on average ~ 
10% higher than for the full samples. In all cases, we find im¬ 
proved agreement with the data compared to the full-sample 
comparison (Table 5). This is consistent with the expectation 
that the CLASH X-ray-selected subsample is largely com¬ 
posed of relaxed clusters (Section 6.2). For the Meneghetti 
et al. (2014) model, the agreement is particularly excellent, 
with a PTE of 0.760. 

Finally, we examine the two-dimensional c-M relations of 
Meneghetti et al. (2014) obtained from fitting X profiles of 
simulated halos (Section 6. 2). 30 As summarized in Table 5, 
we find a better agreement with the model that explicitly takes 
into account the CLASH selection function based on X-ray 
morphology (PTE = 0.730). 

In summary, we find that, overall, cluster c-M relations 
that are calibrated for recent cosmologies yield good agree¬ 
ment with the observations (Bhattacharya et al. 2013; Dutton 
& Maccio 2014; Meneghetti et al. 2014; Diemer & Kravtsov 
2015). An improved agreement is achieved when selection ef¬ 
fects are taken into account in the models (Duffy et al. 2008; 
Bhattacharya et al. 2013; Meneghetti et al. 2014), matching 
the characteristics of the CLASH clusters in terms of the over¬ 
all degree of relaxation and X-ray morphological regularity. 

7.4.2. Comparison with the SaWLens Results 

The observational c-M relation of Merten et al. (2015) 
is derived from their SaWLens analysis of a sample of 
19 CLASH X-ray-selected clusters (Section 4.1; Figure 11). 
Their sample includes all 16 clusters in our X-ray-selected 
subsample. Their NFW concentrations scale with halo mass 
and redshift as c oc jpf-o.32±o.i8 anc j c (x (i _|_ z )0.i4±o.52 

with a normalization of c| 2= o.37 = 3.66 ± 0.16 at M200C = 

30 We refer to Meneghetti et al. (2014) and Giocoli et al. (2012) for general 
discussions on the effects of projection bias in measuring the c-M relation 
from lensing. 


8 x 10 14 Mq hr 1 . Their slope is in good agreement with our 
results, while their normalization is somewhat lower than our 
measurement (Sections 6.1 and 6.2). The best-fit c-M rela¬ 
tion of Merten et al. (2015) is compared with our regression 
results in Figure 7. We see that the confidence regions overlap 
well at the 1<j level. The Merten et al. (2015) relation is also 
in agreement with the stacked lensing results of Umetsu et al. 
(2014) and those of this work. As discussed by Merten et al. 
(2015), their normalization is slightly lower than that of the 
Meneghetti et al. (2014) relation predicted for the CLASH- 
like X-ray-selected halos and in better agreement with that of 
the full-sample relation (including both relaxed and unrelaxed 
halos). 

A quantitative comparison of the Merten et al. (2015) c- 
M relation with our individual cluster measurements is sum¬ 
marized in Table 5. The observed-to-predicted concentration 
ratio has a small scatter of 0.209 around the mean value of 
1.133 ± 0.087 and shows an excellent goodness of fit, with 
PTE = 0.754. A direct comparison for the 16 clusters in 
common shows that our NFW concentrations (Table 2) are 
9.7% ± 10.3% higher than those obtained by Merten et al. 
(2015). We emphasize here that this agreement comes in spite 
of using substantially different reconstruction methods even 
though the data and sample used are largely common to the 
two studies. A multi-probe approach for cluster lensing is one 
of the great advantages of the CLASH survey, providing con¬ 
sistency checks between different lensing methods and dif¬ 
ferent data sets (Coe et al. 2012; Umetsu et al. 2012, 2014; 
Medezinski et al. 2013). 

7.4.3. Revisiting the Overconcentration Problem 

In contrast to the CLASH X-ray-selected subsample, clus¬ 
ters selected to have large Einstein radii (Ovsn) represent a 
highly biased population with their major axis preferentially 
aligned with the line of sight. A population of such super¬ 
lenses might also be biased toward halos with intrinsically 
higher concentrations (Hennawi et al. 2007). In the context of 
ACDM, the projected mass distributions of superlens clusters 
have ~ 40%-60% higher concentrations than typical clusters 
with similar masses and redshifts (Oguri & Blandford 2009). 

Prior to the CLASH survey, Umetsu et al. (2011a, hereafter 
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Figure 10. Joint constraints on the mass and concentration parameters (M20OC, C200c) for our CLASH X-ray-selected subsample f((c|)) ~ 0.34; blue contours) 
and the strong-lensing-selected sample of Umetsu et al. (201 la) ((( 21 )) — 0.32; gray contours) derived from the respective lensing analyses of //.ST-t-Subaru 
observations. For each case, the contours show the 68.3% and 95.4% confidence levels (A\ 2 = 2.3 and 6.17). The results are compared to theoretical c-M 
relations (solid lines) from numerical simulations of ACDM cosmologies (Duffy et al. 2008; Bhattacharya et al. 2013; Dutton & Maccio 2014; Meneghetti et al. 
2014; Diemer & Kravtsov 2015), all evaluated at 2 ~ 0.32 for the full population of halos. The dashed lines show 60% superlens corrections to the solid lines, 
accounting for the effects of selection and orientation bias expected for a strong-lensing cluster population (60%; Oguri & Blandford 2009). Once the effects of 
superlens bias are taken into account, the stacked-lensing constraints on the Umetsu et al. (201 la) sample come into line with the models of Dutton & Maccio 
(2014), Meneghetti et al. (2014), and Diemer & Kravtsov (2015), the three most recent c-M models studied in this work. 


Ull) performed a strong-lensing, weak-lensing shear and 
magnification analysis of four strong-lensing-selected clusters 
of similar masses (A1689, A1703, A370, and C10024+17) at 
(( z \)) ~ 0.32 using high-quality HST and Subaru observa¬ 
tions. These clusters display prominent strong-lensing fea¬ 
tures, characterized by (?Ein ~ 30" ( z s = 2). Ull show that 
the stacked ((E)) profile of the four clusters is well described 
by a single NFW profile in the one-halo regime, with an ef¬ 
fective concentration of c v j r = 7.68 ^q' 4 q (C 200 C — 6.29) 
at M vir = 22.0^4 x l0 14 M Q h?J (M 200c ~ 19.4 x 
10 14 Mq g 1 ), corresponding to an Einstein radius of O^in — 

36" (z. A = 2). Semianalytical simulations of ACDM incorpo¬ 
rating idealized triaxial halos yield a ~ 40%-60% bias correc¬ 
tion for a strong-lensing cluster population (Oguri & Bland¬ 
ford 2009). After applying a 50% superlens correction, Ull 
found a discrepancy of ~ 2a with respect to the Duffy et al. 
(2008) c-M relation based on the WMAP five-year cosmol¬ 
ogy. Ull conclude that there is no significant tension between 
the concentrations of their clusters and those of CDM halos if 
large lensing biases are coupled to a sizable intrinsic scatter 
in the c-M relation. 

Figure 10 compares in the c-M plane the stacked full- 
lensing constraints for our CLASH X-ray-selected subsample 
(blue contours; NFW+LSS (i) in Table 4) and those for the 
strong-lensing-selected sample of U11 (gray contours). In the 
figure we overplot theoretical c-M relations of Duffy et al. 
(2008), Bhattacharya et al. (2013), Dutton & Maccio (2014), 


Meneghetti et al. (2014), and Diemer & Kravtsov (2015, their 
mean relation; see Table 5), all evaluated for the full popu¬ 
lation of halos at the mean redshift (( z \)) ~ 0.32 of the Ull 
sample. Figure 10 demonstrates that c-M relations that are 
calibrated for more recent cosmologies and simulations pro¬ 
vide better agreement with our CLASH measurements (see 
Section 7.4.1). 

To account for the superlens bias in the Ull sample, we 
plot each of the c-M models with a maximal 60% correction 
(Oguri & Blandford 2009). We find that, once the effects of 
selection and orientation bias are taken into account, the full- 
lensing results of U11 come into line with the models of Dut¬ 
ton & Maccio (2014), Meneghetti et al. (2014), and Diemer 
& Kravtsov (2015), the three most recent c-M models stud¬ 
ied in this work. Hence, the discrepancy found by Ull can 
be fully reconciled by the higher normalization of the c-M 
relation as favored by recent WMAP and Planck cosmologies. 

8 . SUMMARY AND CONCLUSIONS 

We have presented a comprehensive analysis of strong- 
lensing, weak-lensing shear and magnification data for a sam¬ 
ple of 16 X-ray-selected and 4 high-magnification-selected 
galaxy clusters at 0.19 ;$ 2 0.69 (Table 1) targeted in the 

CLASH survey (Postman et al. 2012). Our analysis combines 
constraints from 16-band HST observations (Zitrin et al. 2015) 
and wide-field multi-color imaging (Umetsu et al. 2014) taken 
primarily with Subaru/Suprime-Cam, spanning a wide range 
of cluster radii, 6 = 10"-16'. We have carefully taken into ac- 
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count several major sources of uncertainties in our error anal¬ 
ysis (Section 3). 

We have reconstructed surface mass density profiles E of 
all clusters from a joint analysis of strong-lensing, weak- 
lensing shear and magnification constraints (Section 4; Fig¬ 
ure 1 1), providing a unique cluster mass profile data set. We 
find a significant improvement of ~ 45% on average in terms 
of the total S/N of the cluster mass profile measurement from 
adding the central HST lensing constraints (10"-40") to the 
wide-held weak-lensing data (0.9'-16')- 

With the improved sensitivity and resolution at 10"—40", 
we have measured masses and concentrations from these mass 
profiles for individual clusters assuming a spherical NFW 
halo (Tables 2 and 3). The median precision on individual 
cluster mass measurements is found to be ~ 28%, 24%, 23%, 
and 24% at A c = 200, 500,1000, and 2500, respectively. 
We assessed the internal consistency of cluster mass mea¬ 
surements across the multiple probes of cluster lensing ef¬ 
fects (Section 7.2). We find internal consistency of the en¬ 
semble mass calibration to be < 5% ± 6% in the one-halo 
regime, r = 200-2000kpc h~ l (0.01 r/r 2 oom ~ 1; Figure 
8), by comparison with the CLASH weak-lensing-only mea¬ 
surements of Umetsu et al. (2014). This level of uncertainty 
in the ensemble mass calibration, empirically estimated using 
different combinations of lensing probes, is in agreement with 
the systematic uncertainty in the absolute mass calibration of 
~ 6% assessed in Section 7.1. This implies that the total un¬ 
certainty in the absolute mass calibration with the sample of 
20 clusters is a/0.287 20 + 0.06 2 ~ 9% at A c = 200 (Sec¬ 
tion 4.2). 

The CLASH X-ray-selected sample was selected to have a 
high degree of regularity in their X-ray morphology (Postman 
et al. 2012). Numerical simulations suggest that this sam¬ 
ple is prevalently composed of relaxed clusters (~ 70%) and 
largely free of orientation bias (Meneghetti et al. 2014). An 
important effect of the CLASH selection function is to signif¬ 
icantly reduce the scatter in concentration because of their X- 
ray regularity (Section 6.2). For a lensing-unbiased subsam¬ 
ple of 16 CLASH X-ray-selected clusters, we have examined 
the mean c-M relation and its intrinsic scatter using Bayesian 
regression methods (Section 6; Figures 6 and 7). Our analy¬ 
sis takes into account the correlation between the errors on 
mass and concentration and the effects of nonuniformity of 
the intrinsic mass probability distribution. Our model yields 
a mean concentration of c| z= o .34 = 3.95 ± 0.35 at M 200 C = 
1O 15 M 0 ft -1 ~ 14 x 10 14 M q and an intrinsic scatter of 
cr(log 10 C 2 ooc) = 0.056±0.026, orcr(lnc 2 ooc) = 0.13±0.06. 
The normalization, slope, and scatter of the observed c-M re¬ 
lation are all consistent with ACDM predictions (Meneghetti 
et al. 2014) when the projection effects and the CLASH se¬ 
lection function based on X-ray morphological regularity are 
taken into account. Our regression results are in agreement 
with the SaWLens analysis of Merten et al. (2015) and the 
stacked shear-only analysis of Umetsu et al. (2014). This 
multi-probe approach for cluster lensing is one of the key ad¬ 
vantages of the CLASH survey, providing consistency checks 
between different lensing methods and different data sets. 

We have derived an ensemble-averaged surface mass den¬ 
sity profile ((E)) at an average redshift of ((z\)) — 0.34 for 
the X-ray-selected subsample of 16 clusters by stacking their 
individual E profiles (Section 5; Figure 3). The stacked lens¬ 
ing signal is detected at 33cr significance over the entire radial 
range, R < 4000 kpc /i _1 , accounting for the effects of in¬ 


trinsic profile variations and uncorrelated LSS along the line 
of sight (Figure 1). 

Our CLASH lensing determination of the cluster mass dis¬ 
tribution provides a firm basis for a detailed comparison with 
theoretical models. We show that the ((E)) profile is well 
described by a family of density profiles predicted for DM- 
dominated halos in gravitational equilibrium (Table 4; Fig¬ 
ure 4), namely, the NFW, Einasto, and DARKexp models. 
Of these, the first two are phenomenological models and 
the last is a theoretically derived model (Hjorth & Williams 
2010; Williams & Hjorth 2010). The single power-law, cored 
isothermal and Burkert density profiles are disfavored by the 
observed mass profile having a pronounced radial curvature. 

We find that cuspy halo models that include the large-scale 
two-halo contribution using the b^-M relation of Tinker et al. 
(2010) provide improved agreement with the data. Indepen¬ 
dent of the chosen halo density profile, we find 7 (M-jooc) ~ 
9.3 (6 h cr| ~ 6.1). For the NFW halo model (NFW+LSS (i)), 
we measure a mean concentration of C 200 C = 3.791q 28 at 
M 200 C = 14.11) qX 10 14 Mq , demonstrating consistency 

between complementary analysis methods (Figure 7). This 
model yields an Einstein radius of 0Ei n = M.OjHj'l arcsec at 
z s = 2, which agrees within 2<r with the observed median 
Einstein radius of $Ein = 20.1" (z s = 2) for this subsample 
(Section 3.5). The Einasto shape parameter is constrained to 
be ciE = 0.248^0^047 (l/ a E = 4.03+o gg), which is in good 
agreement with predictions from ACDM numerical simula¬ 
tions (Gao et al. 2008; Meneghetti et al. 2014). 

A systematic comparison between different cluster lensing 
surveys that use different approaches to measuring the masses 
of clusters allows us to identify (known and unknown) sys¬ 
tematic effects. In the last few years, a substantial effort 
has been devoted to establishing an accurate mass calibration 
from cluster lensing (von der Linden et al. 2014a; Hoekstra 
et al. 2015; Israel et al. 2015), in light of the apparent ten¬ 
sion in cosmological constraints from Planck primary CMB 
data and SZE cluster counts (Planck Collaboration et al. 2014, 
2015). We compared our CLASH lensing masses (Tables 2 
and 3) with weak-lensing masses from other surveys (WtG, 
CCCP, LoCuSS). Our mass measurements are in excellent 
agreement within ler with the WtG (Applegate et al. 2014) 
and LoCuSS (Okabe & Smith 2015) surveys, with which we 
have 17 and 5 clusters in common, respectively. At higher 
mass overdensities A c >, 500 where weak-lensing measure¬ 
ments are subject to various systematics, the agreement ap¬ 
pears to be less impressive (Sections 7.3.2 and 7.3.4). We find 
that our measurements are on average 16% and 9% higher 
than the CCCP measurements at A c = 500 and 2500, respec¬ 
tively. With limited statistics, however, we do not find statis¬ 
tically significant differences between the CCCP and CLASH 
results. 

Our mass measurements are found to be ~ 7%-9% higher 
than the CLASH SaWLens results of Merten et al. (2015), 
with which we have 16 clusters in common. This difference is 
smaller than the systematic mass offset of ~ 10% ± 5% found 
between the CLASH weak-lensing (Umetsu et al. 2014) and 
SaWLens (Merten et al. 2015) mass profiles. Hence, com¬ 
bining the central //5T-lensing and outer weak-lensing data 
has reduced the discrepancy with respect to the SaWLens 
results. We find that this remaining discrepancy can be at¬ 
tributed to three outliers (Section 4.1), which correspond to 
clusters at the lowest Galactic latitudes (b < 30°) of the 
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overlap sample and those at the highest redshifts (z A 0.45) 
which exhibit complex mass distributions with a high degree 
of substructure (Postman et al. 2012; Merten et al. 2015). 
Since the data used are largely common to the two analyses 
except for the inclusion of weak-lensing magnification data 
in this work, this discrepancy likely arises from systematics 
in the present calibration of magnification measurements for 
these low Galactic latitude clusters and high redshift clusters. 
Weak lensing reconstructions are sensitive to the treatment 
of boundary conditions if there are massive structures near 
the data boundaries. Hence, mass profile reconstructions for 
clusters with high degrees of substructure can be subject to a 
greater degree of mass-sheet degeneracy. Excluding the three 
outliers brings the two results into agreement within 1% at 
A c = A v j r , 200, and 500 and within 5% at A c = 2500. 

In the CLASH survey (Postman et al. 2012), we have 
demonstrated the power of multi-probe, multi-scale data sets 
available from a space telescope cluster survey (Zitrin et al. 
2015; Merten et al. 2015) combined with X-ray (Donahue 
et al. 2014), SZE (Sayers et al. 2013; Czakon et al. 2015), and 
wide-held imaging plus spectroscopic (Umetsu et al. 2014; 
Rosati et al. 2014) follow-up observations. Extending this 
type of cluster survey to a large sky area, as planned for the 
WFIRST and Euclid missions, will be a significant step for¬ 
ward in obtaining a comprehensive picture of the evolution of 
clusters over cosmic time and across populations, as well as in 
understanding the evolutionary and tidal effects of surround¬ 


ing LSS on the mass distribution of the central cluster. 
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APPENDIX 

A. DISCRETIZED EXPRESSIONS FOR CLUSTER LENSING PROFILES 

First, we derive a discrete expression for the mean interior convergence k oc (< 9) as a function of clustercentric radius 9 using 
the azimuthally averaged convergence {()). For a given set of (N + 1) concentric radii 0, defining N radial bands in the range 
0 min = 9i < 9 < 0jv+i = 0 max , a discretized estimator for /£<*-,(< 9) can be written as 


ft'oo @i) 


min 

~e~ 


2 

$min) H” 


9 i ~ 1 

02 51A In 0J0J-KOO (<?,,■). 

1 3 =1 


(Al) 


with Ain 9i = {6 l+ \ — and 9i the area-weighted center of the zth bin defined by [9 i: 0 i+1 ], In the continuous limit. 


9i= 2 
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2 Oh i - 0? 

3 0 2 +1 -9l 


Next, we derive discretized expressions for the tangential reduced shear g+{9) and the inverse magnification // 1 (0) in terms 
of the binned convergence n 00 {6i), using the following relations: 


,-Q \_ (W)g [ K oo(< @i) ^oo(^z)] 

9+ i _ i-k 9 (n«oo(«.) ’ 

= [! - {W) l _ l K oc (0i)] 2 - (W)l [koo(< 0i) - kM)} 2 , 


(A2) 

(A3) 


where both depend on the mean convergence interior to 9i, «oo(< A,)- By assuming a constant density in each radial band, we 
find the following expression for Koo(< 9i ): 


^oo ^2) 2 


l r 


iPi/^i) tCoo(< 0i) + [di+l/9j) K 00 (< ^i+l) 


(A4) 


where Koo(< $i) and Noo(< $i+i) can be computed using Equation (Al). 

Accordingly, all relevant cluster lensing observables can be uniquely specified by the binned convergence profile 

1 with /too,min = ^min ) and = ^oo{,9 i) • 
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B. COMPARISON OF SURFACE MASS DENSITY PROFILES 


Appendix B includes the surface mass density profiles for our cluster sample obtained in this study, along with those of Umetsu 
et al. (2014) and Merten et al. (2015). 
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Figure 11. Surface mass density profiles derived from a joint analysis of HST strong/weak-shear lensing and ground-based weak shear/magnification lensing 
data (black squares) we have obtained for a sample of (a), (b) 16 X-ray-regular and (c) 4 high-magnification clusters selected from the CLASH survey. For 
each cluster, the central bin E(< 0 m in) is marked with a horizontal bar. The location of each binned E point represents the area-weighted center of the radial 
band (Appendix A). The error bars represent the lcr uncertainty from the diagonal part of the total covariance matrix including statistical, systematic, projected 
uncorrelated LSS, and intrinsic-variance contributions, C = C stat + C sys + C lss + C int (Section 3.6.3). The gray area in each plot shows the best-fit NFW 
profile (68% CL) for the observed E profile. The results are compared to the shear+magnification results (blue circles) of Umetsu et al. (2014) and those from 
a SaWLens (red dots) analysis of Merten et al. (2015). The scale on the right vertical axis shows the corresponding lensing convergence acoo scaled to the 
reference far-background source plane. 

































































































































28 


CLASH: Strong-lensing, Weak-lensing Shear and Magnification Analysis 


R [h kpc] 

20 50 100 200 500 1000 2000 



8 [arcsec] 

R [/f 1 kpc] 

a20 50 100 200 500 1000 2000 


It 


f 


MACSJ1720.3+3536 




5 10 20 100 200 500 1000 

6 [arcsec] 

R [/? —1 kpc] 

ie 20 50 100 200 500 1000 2000 


ft I 


MACSJ1206.2-0847 


fit 


5 10 20 100 200 500 1000 

8 [arcsec] 

R [/f 1 kpc] 

6 50 100 200 500 1000 2000 


Ifesa- t t RXJ1347.5-1145 

* iii 


I 


i 


5 10 20 100 200 500 1000 

8 [arcsec] 


R [h kpc] 

; 20 50 100 200 500 1000 2000 


£ 

-c 

H 



8 [arcsec] 

R [/f 1 kpc] 

6 20 50 100 200 500 1000 2000 



8 [arcsec] 

R [/f 1 kpc] 

50 100 200 500 1000 2000 


■Hii 


MACSJ0329.7-0211 


ft l 


5 10 20 100 200 500 1000 

8 [arcsec] 

R [/f 1 kpc] 

6 50 100 200 500 1000 2000 5000 


MACSJ0744.9+3927 




5 10 20 100 200 500 1000 

8 [arcsec] 


Figure 11. (Continued.) 
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